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98 Human Secreted Proteins 

Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, numan cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
organelle, contains different proteins essential for the function of the organelle. The '. 
cell uses "sorting signals." which are amino acid motifs located within the protein, to 
target proteins to particular cellular organelles. . ' 

One type of sorting signal, called a signal sequence, a signal peptide, or a 
leader sequence, directs a class of proteins to an organelle called the endoplasmic 
1 reticulum (ER). The ER separates the membrane-bounded proteins from all other 
types' of proteins. Once localized to the ER, both groups of proteins can be further 
directed to another organelle called the Golgi apparatus. Here, the Golgi distributes 
the proteins to vesicles, including secretory vesicles, the cell membrane, lysosomes, 
arid the other organelles. 

s 4 Proteins targeted to'the ER by a signal sequence can be released into the 
extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the 
extracellular space - a process called exocytosis. Exocytosis can occur constitutively 
or after receipt of a triggering signal. In the latter case, the proteins are stored in 
secretory vesicles (or secretory granules) until exocytosis is triggered. Similarly, 
proteins residing on the cell membrane can also be secreted into the extracellular 
space by proteolytic cleavage of a "linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These. secreted proteins . 
include the commercially valuable human insulin; interferon. Factor VIII, human 
growth hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of 
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the pervasive role of secreted proteins in human physiology, a need exists for. \ 
identifying and characterizing novel human secreted proteins and the genes that ! 
encode them, this knowledge will allow one to detect, to treat, and to prevent 
medical disorders by using secreted proteins or the genes that encode them. 

Summary of the Invention , 

The present invention relates to novel polynucleotides 'and the encoded * ' ■ ' 
polypeptides. Moreover, the present invention relates to vectors, host cells, 
antibodies, and recombinant methods for producing the polypeptides and 
polynucleotides. Also provided are diagnostic methods for detecting disorders related 
to the polypeptides, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for. identifying binding partners of the 
polypeptides. 

Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
J terms used throughout this specification. ' * . ' 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example /an isolated t 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular eel 1 is not the original environment of the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable v 
of being directed to (he ER, secretory vesicles, or the extracellular space as a result of 
a signal sequence, as wel! as those proteins released into the extracellular space , 
without necessarily containing a signal sequence. If the secreted protein is released* 
into the extracellular space, the secreted protein can undergo extracellular processing 
to produce a "mature" protein. Release into the extracellular space can occur by many' . 
mechanisms, including ex ocytoMs arid proteolytic cleavage. 1 
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< ' In specific embodiments, the polynucleotides of the invention arc less than 
300 kb, 200 kb. 100 kb/50 kb/15-kb, 10 'kb, or 7.5 kbin length. In a further 
embodiment, polynucleotides of the invention comprise at least 15 contiguous 
nucleotides of the coding sequence, but do not comprise all or a portion of any iniroa 
In another embodiment, the nucleic acid comprising the coding sequence dues not 
contain coding sequences of a genomic flanking gene (i.e., 5* or 3' to the gene in the 
genome). 

As used herein , a "polynucleotide" refers to^molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within ihe clone 
deposited with the ATCC. For example; the polynucleotide can contain the 
nucleotide sequence of the full length cDNA sequence, including the 5* and 3' 
/ untranslated sequences, the coding region, with or without the signal sequence, the 
secreted protein coding region, as well as fragments, epitopes, domains, and Variants 
of the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having the translated amino acid sequence generated from the 
polynucleotide as broadly defined- r 

In the present invention, the full length sequence identified as SEQ ID NO:X 
was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence for SEQ ID : 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1 . each clone is identified by' a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, USA. Hie ATCC deposit was made pursuant to the 
terms of the Budapest Treaty oh the international recognition of the deposit of 
microorganisms for purposes of patent procedure. ^ . V 

A "polynucleotide" of the present invention also includes those 
polynucleotides capable of hybridizing, under stringent hybridization conditions, to 
sequences contained in SEQ ID NO:X, the complement thereof, or the cDNA within 
the clone deposited with the ATCC. "Stringent hybridization conditions" refers to an 
overnight incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 
rnM NaCl, 75 rnM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
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solution, 10% dextran sulfate, and 20 u,g/ml denatured, sheared salmon sperm DNA, 
, followed by washing the filters in 0. lx SSC at about 65°C. . 

Also contemplated are nucleic acid molecules that hybridize to the > 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accompli shed through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M 
NaH,P0 4 ; 0.02M EDTA, pH 7.4). 0.5% SDS, 30% formamide. 1 00 ug/ml salmon- . 
sperm Mocking DNA; followed by washes at 50°C with 1XSSPE, 0.1% SDS. In 
addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

Note that variations in thc above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available f proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to. problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3' terminal po!yA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide, would hybridize to any nucleic acid * 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically \j 
any double-stranded cDNA clone). - • 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or * 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
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that may be single-stranded or, more typically, double -stranded or admixture of single- 
arid double-stranded regions. In addition, the polynucleotide can be composed of 
triple-stranded regions comprising RNA or DNA or both RNA and DNA. A 
polynucleotide may also contain one or more modified bases or DNA or RNA 
backbones modified for stability or for other reasons. "Modified" bases include, for 
example, triiylated bases and unusual bases such as inosine. A variety of 
modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metaboJically modified forms, 

The, polypeptide of the present invention can be composed of amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres. and may contain amino acids other than the 20 gene-encoded amino acids. •*'■ 
The polypeptides may be modified by either natural processes, such as 
posttranslational processing, or by chemical modification techniques which are well 
known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid 
side-chains and the amino or carboxyl termini.. It will be appreciated that the same 
type of modification may be present in the same or varying degrees at several sites in 
a given polypeptide, Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched , for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, 
and branched cyclic polypeptides may result from posttranslation natural processes or 
may be made by synthetic methods. Modifications include acetylation. acylation, 
ADP-ribosy lation, amidation, covalent, attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
cross-linking, eye lization, disulfide bond formation, demethylation, formation of 
covalent cross- links, formation of cysteine, formation of pyroglutamate, formylation, 
, gamma-carboxylation, glycosyiation, GPI anchor formation, hydroxy! ation, 
iodination, meihylation, myristoylation, oxidation, pegylation, proteolytic processing, 
phosphorylation, prcnylation, racemizatibn, sclenoylatioh. sulfation. transfer-RNA , 
mediated addition of amino acids to proteins such as arginyiation, and ubiquitination. 
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(See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL CO VALENT MODIFICATION OF PROTEINS, B. C. 
Johnson. Ed., Academic Press. New York, pgs. 1-1-2 (1983): Seifter et al., Meth 
Enzymol 182:626-646 (1990); Rattan et aj.. Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X'\refers to a polynucleotide sequence while "SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table I 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the. 

. present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency docs 
exist, it need not be identical to that of the polypeptide; but rather substantially similar 
to the dose-dependence in a given activity as compared to the polypeptide of the 
present invention (i.e., the candidate polypeptide will exhibit greater activity or not 
more than about 25 -fold less and, preferably, hot more than about tenfold less 

' activity, and most preferably, not more than about three -fold less activity relative to 
the polypeptide of the present in ventioe.) 

Polynucle otides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

The translation product of this gene is a human glycoprotein-associated amino 
acid transporter (See, e.g. , Genbank Accession No. erhblCAA 10198.11 (AJ 1 307 18); 
all references available through this accession are hereby incorporated by reference 
herein). Amino acid transport across cellular membranes is mediated by multiple 
transporters with overlapping specificities. The transport system L, which mediates 
Na+-indcpcndent exchange of large neutral amino acids, consists of a novel amino 
acid permease-related protein (LAT1 or ArhAT-L-Ic) which for surface expression 
and function requires formation of disul fide-linked heterodimers with the 
glycosylated heavy chain of the h4F2/CD98 surface antigen, h4F2hc also associates 
with other mammalian light chains, e.u. y+LATl fronrmouse and human which are 
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approximately 48% identical with LAT 1 and thus belong to the same family of 
glycoprotein-associated amino acid transporters^ 

The hovel heterodimers form exchangers which mediate the cellular efflux of 
catiohicamino acids and the Na+-dependent uptake of large' neutral amino acids. 
Toese transport characteristics and kinetic and pharmacological fmgerprmts identify 
, them as y+L-typc transport systems. niRNA encoding my+LATI is detectable in most 
, adult tissues and expressed at high levels in kidney cortex and intestine. This indicates 
that the y+LAT 1 -4F2hc heterodimer, besides participating in amino acid 
uptake/secretion in many cell types, is the basolatcral amino acid exchanger involved 
in transepitheiial reabsorption of cationic amino acids; hence, its defect might be the 

* cause of the human genetic disease lysinuric protein intolerance. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
14. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 14. , 

Preferred polypeptides comprise the following amino acid sequence: 
LALYSALFS YSGWDTLN (SEQ ID NO: 237), VTEEIKNPERNLPL (SEQ ID NO: - 
238), IGISMPIVT (SEQ ID NO: 239), IYILTNVAYYTVL (SEQ ID NO: 240); SDA 
' VAVTFADQ (SEQ ID NO: 241), VALSCFGGLNASI (SEQ ID NO: 242)fSRLFFV 
GSREGHLPD (SEQ ID NO: 243), SFSYWFFVGl.S (SEQID NO: 244), VGQLYLR 
WKEP (SEQ ID NO: 245), RPRPLKLSVFFPIVFC (SEQ ID NO: 246), DTINSLIGI 
(SEQ ID NO: 247), LLAAACICLLTTmCAYVKWGTLVQDIFTYAKVLALIAVI 
VAGIVRLCQGASTHreNSFEGSSFAVGDlALALYSALFSYSGWDTLNYVTEEI 
KNPERNLPLSIGISMPr\n'IIYILTNVAYYTVLDMRDILASDAVA 
NWIIPLSVALSCFGGUVASIVAASRLrWGSREGHLPDAICMIHVERFTPVPSLL 

• ^GIMALTYLCVnDIFQLINYYSFSYWFr^G^ 
VF^PIVFCLCTIFL V A VPL YSDTlNSLlGIAIALSGLPFYrT-UR 

.« VGSATRYLQVLCMS VAAEMDLEDGGEMPKQRDPKSN (SEQ ID NO: 249) 
and/or ATALPPKI VGSATRYLQVLCMS VAAEMDLEDGGEMPKQRDPKSN 
(SEQ ID NO: 248). Polynucleotides encoding these polypeptides are also provided. 
. Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP-1 monocyte cells 
to calcium. Thus, it is likeiy that the product of this gene is involved in a signal 
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transduction pathway that is initiated when ihe product binds a receptor on the surface 
of the plasma membrane of both THP-1 monocytes, in addition to other cell-lines or 
tissue cell types. Thus, polynucleotides and polypeptides have uses which include, but 
are not limited to, activating monocytes. 

This gene is expressed primarily in endothelial cells and brain, and. to a lesser 
extent, in a wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the neural or gastrointcsinaj systems. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the circulation 
system or central nervous system, expression of this gene at significantly higher or • 
lower levels is routinely detected in certain tissues or cell types (e.g., neural, 
gastrointestinal, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative 10 the standard gene expression 
level, i.e.. the expression level in healthy tissue or bodily fluid from an individual not 
having' the disorder. t 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 124 as residues: Glu-102 to Asn-1 10, Arg-256 to 
Leu-266, Pro-3 16 to Trp-328, Pro-33 1 to Arg-336, Met-350 to GIy-358. 
Polynucleotides encoding said polypeptides are also provided. * ' 

The tissue distribution in brain combined with its homology to a amino acid 
transporter and biological activity of increasing ion flux in monocytes indicates 
' polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions. Representative uses are described in the 
"Regeneration" and "Hyperproiiferative Disorders" sections below, in. Example 11, 
15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
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. Huntington*s Disease, Toureue Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations ; spinal 
j 0 cord injuries, ischemia and infarc tion. aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
5 learning disabilities. ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
^ ■ * ■ expression of this gene product in regions of the brain indicates it plays a role in . 

normal neural function. 

Potentially, this gene product is involved in synapse forrnation, 

10 neurotransmission, leamine, cognition, homeostasis, or neuronal differentiation or . - 

20 , ■ * 

survival. Furthermore, the protein may also be used to determine biological activity, 
k ■ to raise an tibodies, as tissue markers, to isolate cognate ligands or receptors, to 

identify agents that modulate their interactions, in addition to its use as a nutritional 
25 supplement. Protein, as well as. anti bodies directed against the protein may, show 

15 utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 
, available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 11 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specific 
20 excluded from the scope of the present invention. To list every related sequence is 
35 cumbersome. Accordingly, preferably excluded from the present invention are one or 

more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1550 of SEQ ID NO: II, bis an - 
■ - integer of 15 to 1564, where both a and b correspond to the positions of nucleotide 
40 " 25 residues shown in SEQ ID NO:l 1, and where b is greater than or equal to a + 14. , - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

J 5 The g;ene encoding the disclosed cDNA is believed to reside on the X 

chromosome. Accordingly, polynucleotides related to this invention are useful as a 
'30 marker in linkage analysis for the X chromosome. 

4 Preferred polypeptides of the invention comprise the following amino acid 
50 sequence: 



30 



55 
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AAJ^GSGVRDPLEEAyCPFSDLQLIiAGRTTALFKAVRQGHLSLQR^ 
CPAPRGGAYRGRQASLSCGGLHPVRASRLLCLPKQAWAMAGAPPPVSLPPCS 
LISDCCASNQREis VG (SEQ ID NO: 250). Polynucleotides encoding these . 
polypeptides are also provided. 

This gene is expressed primarily in cord blood cells, and, lo a lesser extent, in 
frontal lobe of the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases arid conditions which include, but are 
not limited to, developmental, reproductive, hematopoietic or neural disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and central nervous systems, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., immune, 
developmental, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
amniotic fluid, scrum, plasma, urine, synovial fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having.the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 125 as residues: His-56 to Gln-65, Leu-80 to Ile-85. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in cord blood cells indicates polynucleotides and - 
■ polypeptides corresponding to this gene are, useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia since stromal cells are important in the production of 
cells of hematopoietic lineages. Representative uses are described in the "Immune 
Activity" and "infectious disease" sections below, in Example 11, 13. 14. 16. 18, 19, 
20. and 27, and 'elsewhere herein. Briefly, the uses include bone marrow cell ex-vivo 
culture, bone marrow transplantation, bone marrow reconstitution,' radiotherapy or 
chemotherapy of neoplasia. , 
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The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of 
stenrcells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Alternatively, 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and' treatment of cancer and other proliferative disorders. Expression within 
.embryonic tissue and other cellular sources marked by proliferating cells indicates - • . ' 
that this protein may play a role in the regulation of cellular division. Similarly, 
embryonic development also involves decisions involving cell differentiation and/or 
apoptosis in pattern formation. Thus, this protein may also be involved in apoptosis or 
tissue differentiation and could again be useful in cancer therapy. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show'utility as a tumor marker and/or ? 
immunotherapy targets for the above listed tissues. , . 

Many polynucleotide sequences, such as EST sequences, are publicly , 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 12 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a js any integer between 1 to 1 743 of SEQ ID NO: 1 2, b is an 
integer of 15 to 1757, where both a and b correspond to the positions of nucleotide < 
residues shown in SEQ ID NO: 12. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

. This gene is expressed primarily in human T cell lymphomas. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell lype(s) present in a , 
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bioIogicaJ sample and for diagnosis of diseases and conditions which include, but arc 
not limited to, T cell lymphoma, immunodeficiencies; in addition to other immune 
system disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily fluids (e:g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. "■*•*'■' f 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 126 as residues: Met-i to Phe-10. Polynucleotides 
encoding said polypeptides are aJso provided. 

The tissue distribution in human T cell lymphomas indicates polynucleotides 
and polypeptides corresponding to this gene are useful for the diagnosis and treatment 
, of a variety of immune system disorders. Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 1 1, 13, 14, 
16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in. regulating the proliferation; survival; differentiation; and/or ' 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g., by boosting 
immune responses). * _ . 

Since the gene is expressed in cells of lymphoid origin, the natural gene . 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
Bowel disease, sepsis, acne, neutropenia, neuirophili a, psoriasis, hypersensitivities, 
such as T-ceil mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft rversus-tiost diseases, or autoimmunity 
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disorders, such asautoimmune infertility, lense tissue injury,' demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Furthermore, the protein may also be used to determine biological activity, 
raise antibodies, as tissue markers, to isolate cognate Hgands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 13 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is , 
cumbersome. Accordingly, preferably excluded from the presenL invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 1 359 of SEQ ID NO: 13, b is an 
integer of 15 to 1373, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 1 3, and where b is greater than or equai to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

The protein product of this clone shares sequence homology with the C-, 
tcminus of a human N-acetylglucosamine-phosphate mutase (See, e.g., Genbank 
Accession No. gbIA AC72409.il (AF 102265); all references available through this 
accession are hereby incorporated by reference herein.) Hofroann, et al. (Eur. J. 
Biochem. 221:741-747 (1994)) studied the N-acetylgiucosamine-phosphatc mutase of 
Saccharomyces cereyisiae and showed it to be essential for viability. A S. cerevisiae 
agml deletion mutant progressed through only approximately five cell cycles to form 
a 'string' of undivided cells with an abnormal cell morphology resembling 
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glucosamine auxotrophic mutants. Expression of the AGM 1 .gchc on a multi-copy 
.plasmtd led to a significantly increased N-acetylgiucosainine-phosphate mutase 
activity. Unlike over-expression or the S. cerevisiae AGM 1 gene in a ' ; 
phdsphoglu'comutase (pgrril delta/pgm2 delta) double deletion mutant which could 
restore phosphoglucumutase activity, over -expression of the PGM2 gene encoding the 
major isoenzyme of phosphoglucomutasc did not increase N-acetylglucosaihine- • 
phosphate- mutase activity and did not restore growth of agml deletion mutant cells. 
- These observations indicate that the different hexosepKosphate mutases of S. 
cerevisiae have partially overlapping substrate specificities but, nevertheless, distinct 
physiological functions. The human N-acetylglucosairune-phosphate -mutase is 
expected to share at least some biological activities with the Agm 1 protein. 

• Preferred polypeptide fragments of the invention comprise the following 
amino acid sequences: LSKAFLDSPNRLLAVENWTDHLRLTVPNGIGALKLRXM 
EHYFSC^LSVQLFNDGSKGKLNHLCGADFVKSHQKPPQGMEIKSNERCCSFD 
GDADRTvTYYHDADGHraLIDOT 

NGSSTRYLEEVMKVPVYCTKTGVKilLHHKAQEFDIGVYreANGHGTALFST 
AVEMKIKQSAEQI^DKKRKAAKMLENIIDLmQAAGDAlSDMLViEAILAL^ 
GLTVQQWDAL YTDLPNRQLKVQV ADRRV1STTX AERQA YTPPGLQEAINDL 
VKKYKLSRAFVRPSGTEDVVRVYAEAPSQESADHLAHEVSLAVFQLAGGIGE 
RPQPGF (SEQ ID NO: 25 0, LSKAFLDSPNRLLAVEMNTDHLRLTV (SEQ ID 
NO: 252), PNGIGALKLRXMEHYFSQGLSVQLFNDG (SEQ ID NO: 253), SKGKL 
NHLCGADFVKSHQKPPQGMEIKS (SEQ ID NO: 254), NERCCSFDGDADRIV 
YYYHDADGHFHLI (SEQ ID NO: 255), DGDKIATLISSFLKELLVEIGESLNIGV 
(SEQ ID NO: 256), VQTAYANGSSTRYLEEVMKVPVYCTKTG (SEQ ID NO: 
257), VKHLH HK AQEFDIG VYFEANGHGTALFS (SEQ ID NO:258), TAVEMK. 
IKQSAEQLEDKKRKAAKMLENI (SEQ ID NO: 259). IDLFNQAAGDAISDM 
LVIEAILALKGLT (SEQ ID NO: 260), VQQWDALYTDLPNRQLKVQVADRR 
VIST (SEQ ID NO: 261), TXAERQAVTPPGLQEAINDLVKKYKLSR (SEQ ID 
NO: 262), AFVRPSGTEDVVRVYAEADSQESA (SEQ ID NO: 263).'and/or DH 
LAHEVSLAVFQLAGGIGERPQPGF (SEQ ID NO: 264). Polynucleotides encoding 
these polypeptides arc also provided. 
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The gene encoding the disclosed cDNA is believed to reside on chromosome 
6.. Accordingly, polynucleotides related to this invention arc useful as a marker in 
linkage analysis for chromosome 6. 

This gene is expressed primarily in fetal brain, and, to a lesser extent, in a 
wide variety of human tissues. \ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
, reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but arc 
not limited to, developmental disorders, particularly of the central nervous system. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central and peripheral nervous system, expression of this gene at significantly higher 
or lower levels is routinely detected in certain tissues or cell types (e.g., neural, or 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. , 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 127 as residues: Asn-36 to Lys-42, Lys-53 to Gfn-60, 
IIe-64 to AJa-77, Aia-128 to Tyr-135. Lys-184 to Ala-199, Leu-245 to Leu-250. . 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution of N-acetyiglucosamire-phosphate mutase in fetal . 
brain indicates polynucleotides and polypeptides corresponding to this gene are useful 
for the detection, treatment, and/or prevention of neurodegenerative disease states, 
behavioral disorders, or inflammatory conditions. Representative uses are described in 
the ' Regeneration" and "Hypcrproliferativc Disorders" sections below, in Example 
11, 15, and 18, and elsewhere herein. Briefly, the uses include, but arc not limited to 
the detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's 
Disease, Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, 
demyehnating diseases, peripheral neuropathies, neoplasia, trauma, congenital 
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malformations, spinal cord injuries, ischemia and infarction, aneurysms, hemorrhages, 
schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder, depression, 
panic disorder, learning disabilities, ALS, psychoses, autism, and altered behaviors, 
including disorders in feeding, sleep patterns, balance, and perception. In addition, 
e^vatcd expression of this gene product in regions of the brain indicates it plays a ; 
role in normal neural function. 

Potentially, this gene product is involved in synapse formation, ; : 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to . 
identify agents that modulate their interactions, in addition to its use as a nutritional * 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 14 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically * 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present, invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general . 
formula of a^b, where a is any integer between 1 to 3726 of SEQ ID NO: 14, b is an 
integer of 1 5 to 3740, where both a and b correspond to the positions of nucleotide' 1 
residues shown in SEQ ID NO: 14, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

This gene is expressed primarily in human stomach and stomach tumor cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the gastrointestinal system, particularly cancer of ulcers of 
stomach tissue. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 



WO00/0W98 " _ ■ PCT/LS99/17130 

17 • 

tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the digestive system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., gastrointestinal, 
or cancerous and wounded tissues) or bodily fluids (e.g., bile, lymph, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., t 
* the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. . t 

1 The tissue distribution in tumors of the stomach indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis, treatment and 
intervention of these tumors, in addition to other tumors where expression has been 
indicated. Additionally, the protein-product of this gene may play a role in the normal, •/ 
function of the stomach and/or digestive system. Furthermore, the protein may also be ' 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as. EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 15 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically * 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 182 of SEQ ID NO: 15, h is an 
integer of 15 to 11 96, where both a and b correspond to jhe positions of nucleotide 
residues shown in SEQ ID NO: 15, and where b is greater than or equal to a +. 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

Preferred polypeptides of the invention comprise the following amino acid 
sequences: . 
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FE^PRESNipXIKLGTPTLLAKPCYIVISKRHlt 

NHFVIEIQKN I DCM S G PCPFGE VQLQPS TSLLPTLNRTFIWD V KAHK SIGLELQ 
FSIPRLRQIGPGESCPDGVTHSISGRIDATVVRIGTFCSNGTVSRIKM (SEQ ID 
NO: 266). and/or GTRA APGLG A WGRRSPPSFSPPRPRRPG VMaGLNC<3VSIAL 
LGVLLLGj^RLPRGAEAFEIALPRESNITVLIKLG 

I^lksGERIVFTFSCQSPEN'FfrVIEIQKMDCMSGPCPFGEVQLQPSTSLLPTLNR 
TFIWDVKAHKSIGLELQFSIPRLRQIGPGESCPDGVTHSISGRIDATVVRIGTFC 
SNGTVSRIKNfQFGVKMALHLPWraPRNVSGFSIANRSSIKRLCIIESVFEGEGS 
ATLMS AN VPEGFPEDELMTWQFVVP AHLRAS VS rT^NFN LSNCERKEER VE YY 
[PGSTTNPEVFKXEDKQPGNMAGNFNLSLQGCDQDAQSPGILRLQFQVLVQH 
PQNESNKIYVVDLSNERAMSLTIEPRPVKOSRKF^PGO^CLESRTCSSNLTLT 
SGSKHKISFLCDDLTRLWMNVEKP (SEQ ID NO: 265). Polynucleotides encoding 
these polypeptides are also provided. " v • 

This gene is expressed primarily in placenta, and to a lesser extent in, prostate , 
and ovary. ' : - .• , t • r . ■ 

Therefore, polynucleotides and polypeptides of the invention are useful as. 
reagents for differential identification of thetissue(s) or cell type(s) present in a * ' * 
biological; sample and for diagnosis of diseases and conditions which include, but are 
not limited to, male and female infertility, and associated disorders of the 
reproductive system. Similarly, polypeptides and antibodies directed to these i 
polypeptides are useful in providing immunological probes for differential r 
identification of the tissue(s) of cell type(s). For a number of disorders of the above- 
tissues or cells, particularly of the reproductive system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., reproductive, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, : 
amniotic fluid, seminal fluid, serum, plasma, urine, synovial fluid and spinal fluid) or • 
another tissue or cell sample taken from an individual having such a disorder, relative' 
to the standard gene expression level, i.e., the expression level in healthy tissue or " 
bodily fluid from an individual not having the disorder. • ' ~ * : ■ 

The tissue distribution of this gene in the prostate, placenta 1 and ovary 
indicates that polynucleotides and polypeptides corresponding to this gene are useful 
for treatment, prevention,' and/or diagnosis of male or female infertility v endocrine * 
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disorders, fetal deficiencies, ovarian failure, amenorrhea, ovarian cancer, benign 
prostate hyperplasia and prostate cancer. Similarly, the tissue distribution indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of cancer and other proliferative disorders. Expression within 
placental tissue and other cellular sources marked by proliferating cells indicates that 
this protein may play a role in the regulation of cellular division. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Furthermore, the protein 
may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
i available and accessible through sequence databases. Some of these sequences are 
; related to SEQ ID NO; 16 and may have been publicly available prior to conception of 
the present invention. Preferably^ such related polynucleotides are specifically 
excluded from the scope of the present invention. Xo list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2 195 of SEQ ID NO: 16, b is an 
integer of 1 5 to 2209. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO; 16, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

The translation product of this gene shares homology with the human and rat 
HNK-1 sulfptransfcrasc protein (See, e.g., Genbank Accession Nos. gblAAB88123.il 
(AF022729) and gil292l306lgblAAC04707.ll (AF033827)- all references available 
through these accessions are hereby incorporated herein by reference.) Ong E, et aI.,(J 
Biol Chem. 273(9):5190-5 (1998)) have characterized the human HNK-I 
sulfotransfcrase, and show that it is involved in the synthesis of , the HNK- 1 
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carbohydrate epitope which is expressed on various adhesion molecules in the 
nervous system and on immune cells (e.g., natural killer cells) and is suggested to 
play a role in cell-cell and cell-substratum interactions. Based on the sequence 
similarity, the translation product of this gene is expected to share at least some 
v biological activities with HNK-1 sulfotransferase proteins. Such activities arc known 
in the an, some of which are described elsewhere herein, or in, for example, Bakker, 
et al., J Biol Chem. 272:29942-6 (1997), incorporated herein by reference. Based on 
sequence si milarily between sulfotransferases, a consensus sequence for the active 
site was developed (Ong, et al., supra). The consensus pattern is as follows: 
xxRPDzzzz, where x represents hydrophobic amino acid residues and z represents 
any amino acid residue. Therefore. 

' , Preferred polypeptides of the invention comprise the following amino acid 
sequences: FVRDPFVRL (SEQ ID NO: 267), FLFVRDPFVRLIS (SEQ ID NO: 
268), FLFVRDPFVRLISAF (SEQ ID NO: 269). and/or YLHTSFS RPHTGPPLPTPG 
PDRDRELTADSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEENVRGY 
DWSPRDARi^SPDQGRQQAERRSVLRGFGANSSLAFPTKERAFDDIPNSELSHL 
IVDDRHGAlYCYVPKVACim^KRVMIVLSGSLLHRGAPYRDPLRIPREHVH 
NASAHLTWKFWRRYGlO^Rr^M^ 

FELENEEFYRKFA VPMLRL YAN HTSI ,PAS AREAFRAGLKVSFANFIQYLLDPH 
TEKLAPFNEHWRQVYRLCHPGQIDYDFVGKLETLDEDAAQLLQLLQVDRQ 
LRFPPSYRNRTASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLL 
RD (SEQ ID NO: 270). Polynucleotides encoding these polypeptides are also 
provided. 

Further preferred are the sulfotransferase active site polypeptides listed above, 
and at least 5, 10, 15, 20, 25 ; 30, 50, or 75 additional contiguous amino acid residues 
of the sequence referenced in Table I for this gene. The additional contiguous amino 
acid residues is N-terminal or C- terminal to the sulfotransferase active site 
polypeptides. Alternatively, the additional contiguous amino acid residues is botrrN- 
terminal and C-terminal to the sulfotransferase active site polypeptides, wherein they 
total N- and C-terminal contiguous amino acid residues equal the specified number. 
The above preferred polypeptide domains arc characteristic of a signature specific to 
sulfotransferase proteins. The nucleotides sequence of this gene was found to be 
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homologous to the human hypoxanthinc guanine phosphoribosyl transferase 2 cDNA 
which is know to be involved in the purine salvage pathway resulting in the ' 
maintai nance of homeostatic levels of uric acid (See Genbank Accession No.T30127). 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
7. Accordingly, polynucleotides related to this invention arc useful as a marker in ■ 
linkage analysis for chromosome 7. 

This gene is expressed to a very high level in I1L-60 myelogenous leukemia 
cell lines, and to a lesser extent, in most cell types of the immune system. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a * 
biological sample and for diagnosis of diseases and conditions which include, bur are 
not limited to, immune, myelopoiesis, and metabolic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells; particularly of the immune and 
hematopoietic systems, expression of this gene at significantly higher or lower levels 
is routinely detected in certain tissues or cell types (e.g., immune, metabolic, or 
cancerous and wounded tissues) or bodily fluids (e:g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the • 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 130 as residues: Ser-39 to GIy-46, Leu-49 to Ala-62, 
Lys-79 to Ala-93, GJy-95 to Asp-105, Ser-107 to VaM27, Gly-193 to Leu-200, Lys- 
218 to Ser-227, Lys-234 to Thr-239. Pro-366 to Asp-379, Pro-406 to Asp-414. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in HL-60 myelogenous leukemia cell lines and 
homology to HNK-l sulfotransferase proteins indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis, prevention 
and/or treatment of a variety of immune system disorders, including but not limited 
to, those involving the HNK- 1 carbohydrate epitope, (e.g. HNK-I as an auto-antigen 
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in peripheral demyeli native neuropathy). Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11.13. 14, > 
16. 18, 19. 20, and 27, and elsewhere herein. Briefly, expression of this gene product 
in tonsils indicates a role in regulating the proliferation; survival; differentiation; 
and/or activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer. (e.g., by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene'* --"-j 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosus, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease; scleroderma and tissues. Moreover, the protein may represent a sectetcd 
factor that influences the differentiation or behavior of other hlpod cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the homology to a conserved purine metabolism protein may 
suggest that polynucleotides and polypeptides corresponding to this gene are useful 
for the diagnosis, prevention, and/or treatment of various metabolic disorders such as , 
Tay-Sach's Disease, phenylkenonuria, galactosemia, porphyrias; Hurler's syndrome, 
and various urogenital disorders related to metabolic conditions, particularly Lesch- 
Nyhan syndrome. Furthermore, the protein may also be used to detenmne biological 1 * 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, 
to identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 17 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically ... 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
. formula of a-b, where n is any 'integer between 1 to 1760 of SEQ ID NO: 17. b is an 
integer of 15 to 1 774; where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 1 7, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

. When.tested against Jurkat T-cell lines, supernatants removed from cells 
containing this gene activated the gamma activating sequence (GAS) promoter 
element. GAS is a promoter element found upstream of many genes which are 
involved in the Jak-STAT pathway, a large, signal transduction pathway involved in 
the differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. Thus, it is likely that 
this gene activates T-cells through the Jak-STAT signal transduction pathway; 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: KLVRLQVPVRNSRVDPRVRSKIGSRRWMLQLI 
MQLGSVLLTRCPFWGCFSQLMLYAERAEARRKPDIPVPYLYFDMGAAVLCA 
SFMSFGVKRRWFALUAALQLAISTYAAYIGGYVHYGDWLKVRMYSRTVMI 
GGFLVLASGAGEL^RKPRSRSLQSTGQVFLGIYLICVAYSLQHSKEDRLA 
YU^HLPGGELMIQLFr^LYGILALAn.SGYYVTLAAQn-AVLLPPVMIXIIX^ 
NVAYWHNTRRVEFWNQMKLLGESVGIFGf AVILATDG (SEQ ID NO: 271), f 

A preferred polypeptide fragment of the invention comprises the following, 
amino acid sequence: MQLGSVLLTRCPFWGCFSQLMLYAERAEARRKPDIPVP 
YLYFDMG AAVLCASFMSFGVKRRWFALGAALQLAISTYA AYIGGYVHYGD 
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.WLKVRMYSRTVAUGGFLVLASGAGELYRRKPRSRSLQSTGQVFLGIYLICVA 
YSLQHSKEDRLAYLNHLPGGELN^QLFF^YGILAPGLSVRLLRDPRCPDPGC 
TAAPCHAAH (SEQ ID NO: 272). Polynucleotides encoding these polypeptides are 
also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
17., Accordingly, polynucleotides related to this invention are useful as'a marker in 
linkage analysis for chromosome 17. ' ' 

' This gene is expressed primarily in endometrial tumors, and to a lesser extent, 
in T-ceils, pituitary arid to a certain extent in most cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, female reproductive, immune, or endocrine disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
and/or immune systems expression of this gene at significandy higher or lower levels 
is routinely detected in certain tissues or cell types (e.g.. immune, reproductive, or 
cancerous and wounded tissues) or bodily fluids (e.g.. amniotic fluid, lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not • 
having the disorder . 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 131 as residues: AIa-27 to Asp-34, Tyr-1 16 to Leu- 
125. Polynucleotides encoding said polypeptides are also provided. * 

, The tissue distribution predominancy in the endometrium indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, and/or prevention of a range of immune and/or reproductive 
disorders including endometriosis, endometritis, and endqmetrioma. Similarly, the 
tissue distribution in T-cells and the ability of supematants expressing this gene to* * 
stimulate the GAS promoter element in T-cells indicates polynucleotides and - 
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polypeptides corresponding to this geneare useful for the diagnosis and treatment of a 
variety of immune system disorders. Representative uses are described in the ' 
"Immune Activity" and "infectious disease" sections below, in Example 11, 13; 14, 
16, 18; 19, 20. and 27. and elsewhere herein. Briefly, the expression of this gene ' 
product indicates a role in regulating the proliferation; survival: differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the s treatment of cancer (e.g., by boosting 
immune responses). . 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-celi mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- vcrsus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, dernyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells,' or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may, have 
commercial utilityin the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell, 
types. Alternatively, the tissue distribution in pituitary indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of various endocrine disorders and cancers. Representative uses are \- ' 
described in the "Biological Activity", "Hypcrproliferative Disorders \ and "Binding 
Activity" sections be low, in Example 11, 17, 18, 19, 20. and 27, and elsewhere herein.* 
Briefly, the protein can be used for the detection, treatment, and/or. prevention of the 
Addison's Disease, Cush trig's. Syndrome, and disorders and/or cancers of the pancreas 
(e.g., diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, . • ■ > 
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hypopituitarism), thyroid (e.g., hyper-, hypothyroidism), parathyroid (e.g., hypcr- 

.hypoparathyroidism), hypothalamus, and testes. 1 . 

Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 18 and may have been publicly available prior to conception of 

the present invention. Preferably, such related polynucleotides are speciflcaJJy 

excluded from the scope of the present invention. To list every related sequence is 

cumbersome. Accordingly, preferably excluded from the present invention are one or 

more polynucleotides comprising a nucleotide sequence described by the general 

formulaof a-b, where a is any integer between 1 to 1660 of SEQ ID NO: 18, b is an 

integer of 15 to 1 674, where both a and b correspond to the positions of nucleotide ' 

residues shown in SEQ ID NO: 1 8, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of the myeloid leukemia 
cell, line AML- 1 93 to calcium. Thus, - it is likely that the product of this gene is 
involved in a signal transduction pathway that is initiated when the product binds a 
receptor on the surface of the plasma membrane of myeloid leukemia cells, in 
addition to other celJ-Iines or tissue cell types. Thus, polynucleotides and polypeptides 
have uses which include, but are not limited to, activating myeloid leukemia cells. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence; SNEILLSFPQNYYIQWLNGSLIHGLWNLASLFS 
NLCLFVLMPFAFH^ESbGFAGLKK^ 
ALIDrTOAASMESLYDLWEFYLPYLYSCISLMGCli^ 
HLLVKPTILEDLDEQIYIITLEE 

KTKLERRKKASAWERNLVYPAVMVLLL1ETSISVLLVACNILCLLVDETAM 
PKGTRGPGIGNASLSTFGFVGAALEIIUFYLM 

DTTMTKIIGNCVSILVLSSALPVMSRTLGITRFDLLGDFGRFNWLGNFYIVLS 
YNLLFAI VTTLCL V R KFTS A VREELFKALGLHKLHLPNTS RDS ETA KPS VNGH 
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QKALCSEQ ID NO: 273). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in fetal heart, and to a lesser extent, in colon 
' and the adult pulmonary system. ■ ' 

\ 5 Therefore, polynucleotides and polypeptides of the invention are useful as 

\reagents for differential identification of the tissuc(s) or cell type(s) present in a 
'5 f * biological sample and for diagnosis of diseases and conditions which include, but are 

not limited to, heart, Jung and digestive disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immurioiogical 
probes for differential identification of the tissue(s) or cell type(s). For a number of ' 
disorders of the above, tissues or cells, particularly of the cardiovascular, pulmonary 
and digestive systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., developmental, cardiovascular, 
25 r : or cancerous and wounded tissues) or bodily fluids (e.g.. lymph, amniotic fluid, 

"■. '■15 serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder/relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. v t 

Preferred polypeptides of the present invention comprise immunogenic 
20 epitopes shown in SEQ ID NO: 132 as residues: GIu-67 to Asn-74, GIu-88 to Asn-93, 
Lys-95 to Ala-107, Ala-147 to Arg- 153, Phe-197 to Thr- 205, Pro- 292 to His-308 
Polynucleotides encoding said polypeptides are also provided ' 

The tissue distribution in fetal heart, colon and pulmonary tissues and the 
biological activity in increasing the permeability of the plasma membrane of the 
myeloid leukemia cell line AML-193 to calcium, likely indicating that the product of 
this gene is involved in a signal transduction pathway that is initiated when the 
product binds a receptor on the surface of the plasma membrane of myeloid leukernia 
cells, indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment, prevention, and/or detection of a range of disorders including 
30 a variety of vascular disorders and conditions, which include, but are not limited to 
r : . miscrovascular disease, vascular .leak syndrome, aneurysm, stroke! embolism, 
$0 myocardial infarction, myocarditis, ischemia, thrombosis, coronary anery disease, 
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arteriosclerosis, and/or atherosclerosis; pulmonary edema and embolism, bronchitis 
and/or cystic fibrosis; Crohn's Disease and/or colon cancer. Similarly, the tissue 
distribution indicates that polynucleotides and polypeptides corresponding to this 
, gene are useful for the diagnosis and treatment of cancer and other proliferative 
disorders. Expression within embryonic tissue and other cellular sources marked by 
proliferating cells indicates that this protein may play a role in the regulation of 
cellular division. Similarly, embryonic development also involves decisions involving 
cell differentiation and/or apoptosis in pattern formation. Thus this protein may also 
be involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Furthermore, the protein niay also be used to determine biological activity, to 
raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker, and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences; are publicly 
available and accessible through sequence databases: Some of these sequences are 
related to SEQ ID NO: 19 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically - 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formuiaofa-b, where a is any integer between 1 to 2004 of SEQ ID NO: 19, b, is an 
integer of 15 to 2018, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 1 9, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

The protein product of this clone shares sequence homology with the human 
MaxiK channel beta 2 subunit (See, Genbank Accession No. , ; . 
gWAAD23380.IIAF099l37_l (AF099137); all references available through this 
accession are hereby incorporated herein by reference), which is believed to be a . 
modulatory subunit of the voltage and Ca2+ activated K+ (MaxiK) channel. 
Additionally, this protein snares homology to the human calcium-activated potassium 
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channel beta subunit, which, when combined with its corresponding alpha subunit and 
modulating peptide, are believed to be useful in treating asthma, angina, hypertension, 
incontinence, migraine, irritable bowel syndrome (IBS). The subsequent 
heteromultimer that forms upon combining the alpha, beta, and modulator subunits 
are also thought to be useful in preventing premature labour, preventing and treating 
cerebral ischemia, inducing pain modulation and decreasing neurogenic inflammation 
in a patient (See GeneSeq Accession No. R85306). 

Preferred polypeptides comprise the soluble domain which consists of the 
following amino acid sequence: RSYMQSVWTEESQCTLLNASITETFNCSFSCGP 
DCWKLSQYPCLQVYVNLTSSGEKiLLYHTEETIiaNQKCSyiPKCGKNFEESM 
SLVNVVMENFRKYQHFSCYSDPEGNQKS^ 

GG VAIVAMVKLTQYLSLLCERIQRINR (SEQ ID NO: 274), Polynucleoudes 
encoding these polypeptides are also provided^Based on the sequence similarity, the 
translation product of this gene is expected to share at least some biological activities 
with modulatory subunits of voltage and Ca2+ activated K+ channel proteins. Such 
activities are known in the an, some of which are described in Wallner. el a]., PNAS 
96:4137-4142 (1999), incorporated herein by reference. 

This gene is expressed primarily in adrenal gland tumor and to a lesser extent, 
in Hodgkin's lymphoma. * . 

Therefore, polynucleoudes and polypeptides, of the invention arc useful as 
reagents for differential identification of the ussue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, endocrine and immune disorders, particularly Hodgkin's Lymphoma. - 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissuc(s) or cell 
type(s)'; For a number of disorders of the above tissues or ceils, particularly of the 
immune and/or endocrine systems, expression of this gene at significantly higher or 
lowerlevels is routinely detected in certain tissues or cell types (e.g., immune, 
endocrine, or cancerous and wounded tissues) or bodily fluids (e.g.; lymph, scrum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
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level, i.e.. the expression level in healthy tissue or bodily fluid from an individual hot 
■ having the disorder, 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 133 as residues: Trp-25 toGln-30, Pro-50 toGln-57, 
Pro-93 to GIu-101, Arg-l 14 to Cys-121, Scr-123 to Gin- 129, lie- 177 to Arg-182.' 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in adrenal gland tumor and it's identification as the • 
modulatory subunit of the voltage and Ca2+ activated K+ (NiaxiK) Channel indicates " 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
detection* treatment, and/or prevention of various endocrine disorders and cancers, 
particularly Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of 
the pancreas (e.g., diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g., hyper-, hypothyroidism), parathyroid (e.g.; hyper- > 
hypoparathyroidism), hypothalamus, and testes. Alternatively, expression in 
proliferative immune tissues combined with its homology to a novel human K 
channel indicates that polynucleotides and polypeptides corresponding to this gene, 
are useful for the diagnosis and treatment of a variety of immune system disorders. 
Representative uses are described in the "Immune Activity" and "infectious disease" 
sections below, in Example 11,13, 14 T 16, 18, 19, 20, and 27, and elsewhere herein. ; 
Briefly, the expression of this gene product in Hodgkin's lymphoma indicates a role in : 
regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product is involved 
in the regulation of cytokine production, antigen presentation, or other processes . 
suggesting a usefulness in the treatment of cancer (e.g., by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin- the natural gene * 
product is involved in immune functions. Therefore it js also used as an agent for 
immunological disorders including arthritis, asdima, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis ; acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; inunune reactions to transplanted organs and 
tissues, such as host- vers us -graft and graft-versus- host diseases, or autoimmunity 
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disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythernatosis. drug induced hemolytic anemia, rheumatoid arthritis. Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell . 
types. Furthermore, the protein may also be used to determine biological activity, 
raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed againsr the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
. related to SHQ ID N0:20 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b ; where a is any integer between 1 to 2084 of SEQ ID NO:20, b is an 
integer of J 5 to 2098, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:20, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

The translation product of this gene shares homology with collagen and 

i 

collagen like proteins {See. e.g., Oenbank Accession Nos. 
gil2920535igblAAC39658.11 (AF01808I) and gil23849421gblAAB6996l.ll 
(AF022985); all references available through these accession numbers are hereby 
incorporated by reference herein). Additionally, it has been determined that this gene 
has homology to the human Kruppel related zinc finger protein (HTF10) which is 
known to be important as a transcription factor, particularly in development (See 
Genebank Accession No.Ll 1672). 
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In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the in venuori comprise the 
■ following amino acid sequence: AFAHLQLGPMWKLWRAEEGAAALGGALFLLL 
HALGVRQLLKQRRPMGFPPGPPGLPHGNIYSLAASSELPHVYMRKQSQVYG 
EVQPRI^PGREGRQAGPGWPGPSWIJ}LWPPL^ 

LEGRS PRRRAPLQGEPRPCR (SEQ ID NO: 275). PolynucJeotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in human er>'throleukemia cell line (HEL), 
serum induced smooth muscle, and to a lesser extent in human 8 week whole embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include^ but are' 
not Limited to, leukemia, musculoskeletal, or developmental disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are, useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hematopoietic 
system and muscular system, expression of this gene at significantly higher or lower 
levels is routinely detected in certain tissues or cell types (e.g., immune, 
musculoskeletal, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
amniotic fluid, serum, plasma, urine, synovial fluid and spinal fluid) of another tissue 
or cell sample taken from. an individual having such a disorder, relative to the 
standard gene. expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 134 as residues: Leu-30 to Giy-38. Arg-67 to Val-72, 
Val-76 to Ala-89, Pro-1 18 to Arg-123, Gly-129 to Ala-136, Leu-138 to Arg-146. 
Polynucleotides encoding said polypeptides arc also provided. 

" The tissue distribution in human erythroleukemia ceil line (HEL), and serum 
induced smooth muscle, and the shared homology with collagen and collagen like H 
proteins indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for disorders of hematopoietic or muscular systems, such as leukemia and 
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muscular dystrophy. Additionally, the shared homology with collagen proteins would 
suggest that this protein may also be important in the diagnosis or treatment of 
various autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, 
derrnatomyosttis, etc.), dwarfism, spinal deformation, joint abnormalities, and 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, fumilial 
osteoarthritis, Atelosteogcnesis type II, metaphyseal chondrodysplasia type Schmid, 
etc.).. • ' ' . 

The secreted protein can also he used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identity agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in * 
4he specification. Typical of these are cytoJdne, cell proliferation/differentiation 
modulating activity or induction of other cytokines; 
immunostimulating/inimunosuppressant activities (e.g., for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation. of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g., for treating 
infections, jumours); haemostatic or thrombolytic activity (e.g., for treating 
haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g., for treating '. 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproUferative disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 1 
procedures. Furthermore, the protein may also be used to determine biological 
'activity, to raise antibodies, as tissue markers; to isolate cognate ligands or receptors, 
to identify agents that modulate their interactions, in addition to its use as a nutritional 
^supplement Protein, as well as, antibodies directed against the protein may show 
v—*^ a lumor ma rker and/or immunotherapy targets for the above listed tissues. ** 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:2 1 and may have been publicly available prior to conception of 
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, the present invention. Preferably, such related polynucleotides are 'specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-K where a is any integer between 1 to 1732 of SEQ ID NO: 21, b is an 
integer of 15 to 1746, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:2I, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 , 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MRVRIGLTLLLCAVLLSLASASSDEEGSQD 
ESLGFQDY FDIR (SEQ ID NO: 276). Polynucleotides encoding these polypeptides 
are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
8. Accordingly, polynucleotides related to this invention are useful as a marker in * 
linkage analysis for chromosome 8. v. 

This gene is expressed primarily in dendritic cells. 1 ' * ' 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell rype(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above/ 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., imrhune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene V 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 135 as residues: Ser-22 to Ser-4l. GIu-43 to Thr-50. 
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Ser-63 to -Leu-68. Ser-7 1 to Gly-84, Ser-96 to Gly- 1 14. Polynucleotides encoding said 
polypeptides are also provided. • ■ „ 

The tissue distribution in dendritic cells indicates that polynucleotides and - 
polypeptides corresponding to this gene are useful for the diagnosis; prevention, 
and/or treatment of a variety of immune system disorders. Representative uses are' , 
describecl in the "Immune Activity" and "infectious disease " sections below, in 
Example M, 13, 14, 16. 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the . 
. treatment of cancer (e.g., by boosting immune responses). , 

Since the. gene is expressed in cells of lymphoid origin, the natural gene ' , 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granuiornatou's Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis. drug induced hemolytic, anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells!" or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. , > , . 

The secreted protein can be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
the 1 specification. Typical of these arc cytokine, cell proliferation/differentiation ' : 
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modulating activity or induction of other cytokines; 

immunostimulating/immunosupprcssant activities (e.g., tor treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haemaropoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or, 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g., for treating 
infections, tumours); haemosraric or thrombolytic activity (e.g. , for treating 
haemophilia, cardiac infarction etc.); anti -inflammatory activity (e.g., for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferative disease: for regulation of metabolism, behaviour, and many others! 
Also contemplated is the use of the corresponding nucleic acid in gene therapy / 
procedures. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate Hgands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues - , 

Many .polynucleotide sequences, such as EST sequences, are publicly 1 
available and accessible through sequence databases. Some of these sequences are ; . 
related to SEQ ID NO: 22 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every.relatcd sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising' a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2862 of SEQ ID NO:22, b is an 
integer of 15 to 2876, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:22, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

A preferred polypeptide variant of the invention comprises the following 
amino acid sequence: MARG S LRRLLRLL VLGL WL ALLRS V AGEQ A PGT APC 
SRGSSWSADLDKCMDCSTSCPLPA ALAHPWGRSEPDLRAGAAFWLFGLE • 
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TMPQE REVHHPHRGDRRRGLPSCGADPVTMCPLPAGARPLnHSSILEPVSAS 
QTRREPSSSNHK GGGGR (SEQ ID NO: 277). Polynucleotides encoding these 
polypeptides are also provided. 

The gene encoding the disclosed cDN A is believed to reside on chromosome ' " 
1 6. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 16. . 

this gene is expressed primarily in tumor growth factor or lipopoly saccharide 
treated bone marrow stroma, epithelioid sarcoma, umbilical vein endothelial cells, and 
to a lesser extent, in other tissues. ^ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
' reagents for differential identification of the tissue(s) or cell type(s) present* in a* 
biological sample and for diagnosis of diseases and conditions which include, but arc 
not limited to, hematopoiesis or immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the hematopoietic, 
integumentary, or immune systems expression of this gene at significantly higher or • 
lower levels is routinely detected in certain tissues or cell types (e.g., hematopoietic, 
immune, or cancerous and. wounded tissues) or bodily fluids (e.g., lymph, serum, • 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e.. the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 136 as residues; Pio-35 lo Trp-42, Pro-65 to Asp-72, 
Thr-86 to Phe-93, He-97 to Glu-103. Polynucleotides encoding said polypeptides are 
. also provided. 

The tissue distribution in tumor growth factor or lipbpoJysaccharide treated 
. bone marrow stroma, epithelioid sarcoma, and umbilical vein endothelial cells , 
indicates that polynucleotides and polypeptides corresponding to this gene arc useful 
for the diagnosis, prevention, and/or treatment of a' variety of immune system . • ' 
disorders. Representative uses are described in. the "Immune Activity" and "infectious 
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disease" sections below, in Example U, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
. herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation: survival; differentiation: and/or activation of hematopoietic cell 
lineages; including blood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
-in the treatment of cancer (e.g M by boosting imniune responses). " + 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 11 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-celi mediated cytotoxicity; immune reactions to transplanted organs and ' 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity , 
disorders, such as autoimmune infertility, Ichsc tissue injury, demyelination. systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that ^ 
recruits hematopoietic cells to sites of injury. In addition ^ this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
. types. • ; ' , ;. 

The secreted protein can be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may' also have a . . * 
very wide range of biological activities although no evidence for any is provided in 
the specification. Typical of these are cytokine, cell proliferation/differentiation 
modulating activity or induction of other cytokines; 
immunostimulating/irnrnunosuppressant activities (e.g., for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
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control of fertility); chemocactic and cherrlokinetic activities (e.g., for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g., for treating 
haernophiiia. cardiac infarction etc.); antiinflammatory activity (e.g., for treating 
septic shock, Crohn's Disease ): as antimicrobials; for treating psoriasis or other 
hyperproliferative disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 23 and may have been publicly available prior to conception' of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a- b, where a is any integer between 1 to 1038 of SEQ ID NO:23 T b is an 
integer of 1 5 to 1052, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:23, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

The translation product of this gene shares sequence homology with 
chromaffin granule amine transporter protein which is thought to be important in ,. 
vesicle membrane amine transport, particularly in the neural and endocrine tissue, and 
the human vesicular monoamine transporter hVMATl which is involved in the 
regulation of amine storage in cardiovascular, endocrine, and central nervous system 
function (See, Genbank Accession Nos. gi!1314290 and gblAAC50472.ll; all 
references available through these accession numbers are hereby incorporated by 
reference herein). Based on these sequence similarities. The translation product of 
this gene is expected to share at least some biological activities with amine 5 transporter 
proteins. Such activities are known in the art, some of which are described in 
Erickson, et al., PNAS 93:5 166-5171 (1996), and/or Liu, et al., Ceil 70:539-551 
(1992), which are both incorporated herein by reference. 
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In a specific embodiment, polypeptides comprising the amino, acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GTSRDPTLSLFVLEfCFNLPAGYVGLVFLGMAL 
SYAISSPLFGLLSDKRPPLRKWLLVFGNLITAGCYML 

VLILVVSGl^AGMSIIPTFPEO.SCAHENGraGLSTLGLVSGLFSAMWSIGAF 
MGPTLGGFLYEKIGFEWAAAIQGLWALISGLAMGLFYLLEYSF^KRSKSQ^^ 
STFJRERTTLLPNET (SEQ ID NO: 278). Polynucleotides encoding these 
polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
6. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 6. . , 

This gene is expressed primarily in colon cancer, osteoclastoma, and T-cell , . 
lymphoma, and to a lesser extent in many tumor or proliferative tissues such as 
endometrial tumor, chondrosarcoma, induced umbilical vein endotheliaTcells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. diseases resulting from disorders in small molecule transport (i.e., 
signalling molecules) in afflicted tissues and organs, particularly of the endocrine and 
central nervous systems. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential . 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the musculoskeletal, immune, and/or digestive systems 
and cancer expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or ceil types (e.g., neural, endocrine, or cancerous and 
wounded tissues) or bodily fluids (e.g., bile, lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 137 as residues: Ser-1 14 to Asn-123, Thr-127 to Thr- 
132. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in colpn cancer, osteoclastoma, and T-cell lymphoma 
and homology to amine transporter family members indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
disorders or diseases resulted from smaJl molecule transport in afflicted tissues and . . 
organs, particularly that of colon, osteoclast or t-cells. The expression in cancer 
tissues, and shared homology with transporter proteins may also indicate its role in 
anti-cancer drug resistance. Additionaly, the protein can be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands, to 
. identify agents that modulate their interactions and as nutritional supplements. It may 
also have a very wide range of biological activities although no evidence for any is 
prov ided in the speci fication. Typical of these are cytokine, cell * 
proliferation/differentiation modulating activity or induction of other cytokines; 
imjnuhostimulating/immunosupprcssant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); . 
regulation of haematopoicsis (e.g. for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g. for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g. for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g. for treating , ' 
haemophilia, cardiac infarction etc. ); anti-inflammatory activity (e.g. for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperprolifcrative disease; or for identifying inhibitors or promoters of the transport 
of toxic molecules to vesicles, for regulation of metabolism, behaviour, and many 
others. Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:24 and may have been publicly available prior to conception of 
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the present invention: Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 1527 of SEQ ID NO: 24, b is an 
integer of 1 5 to 1 54 1 , where both a and b correspond to the positions of nucleotide 
' residues shown in SEQ ID NO:24, and where bis greater than or equal to a + 14. - . 

FEATURES OF PROTEIN ENCODED BY GENE NO; 15 

The translation product of this gene shares sequence homology with the 
human prolyl 4-hydroxyIase alpha (II) subunit which is important in catalyzing the 
formation of'4-hydroxyproline in collagens which is essential for the folding of newly 
synthesised collagen polypeptide chains into triple-helical molecules (See Genbank 
Accession No. gbtAAB7 1 339. 1 1; all references available through this accession are 
hereby incorporated herein by reference). Based on the sequence similarity, the 
translation product of this gene is expected to snare at least some biological activities 
with Prolyl 4-hydroxyIase proteins. Such activities are known in the art', some of 
which are. described in Annunen, et ah, J. Biol. Chem. 272: 17342.17348 (1997) 
which is incorporated herein by reference. 

When tested against U937 myeloid and Jurkat T-cclI cell lines, supernatants 
removed from cells containing this. gene activated the gamma activating sequence' 
(GAS), a promoter element found upstream of many genes which are involved in the 
Jak-STAT pathway. The Jak-STAT pathway is a large, signal transduction pathway 
involved in the differentiation and proliferation of cells. Therefore, activarion of the 
Jak-STAT pathway, reflected by the binding of the GAS element, can be used to 
indicate proteins involved in the proliferation and differentiation of cells. Thus, it is 
likely that this gene activates myeloid cells and T-cells through the Jak-STAT signal 
transduction pathway. 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: 
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GfREARiJIDLTRFYDKVLSLHEDSTTPVANPLLAFTLiKRLQSDWRNVVHSL 
EASENIRALKDGYEKVEQDLPAFEDLEGAARALMRLQDVYMLNVKGLAR . 
G WQRVTGS AITDLYS PKRLFSLTGDDCFQVGKVA YDMGD Y YHAIPWLEEA 
VSLFRGSYGEWKTEDEASLEDALDHLAFAYFRAGWSCALSLSREFLLYSPD 
NKRMARN\aKYERLLAESPNHVVAEAVTQRPNIPHLQTRDTYEGLCQTL 
GSQPTLYQ1PSLYCS YETNSNA YLLLQPIRKEVIHLEPYIALYHDFVSDSEAQ 
lORELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTVDLKLVfLNHRIAA 
LTGLDVRPPYAEYT QVWYCHGGH^ 

IYLSSVEACH3ATAFIYANLSVPVVRNAALFWNLHRSGEGDSDTLHAGCT 
VLVGDKWVANKWIHEYGQEFRRPCSSSPED (SEQ ID NO: 282). Additional, 

Preferred polypeptides comprise the following amino acid sequence: GTREA 
RLRDLTRFYDK^LSLHEDSTTPVANPLLAF^ 

RALKDGYEKVEQ DLPAFEDLEGAARAL (SEQ ID NO: 279), ALMRLQD (SEQ 
ID NO: 280), and/or VEAGGAT (SEQ ID NO: 281). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in lymph node breast cancer, colon 
carcinoma, and to a lesser extent in osteoblasts and adipocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are ■■ » 
not limited to, disorders of connective and immune tissues, particularly autoimmune ^ 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are * 
useful in providing immunological probes for differential identification of the 
tissue (s) or cell type(s)^ For a number of disorders of the above tissues or cells, 
particularly of the connective tissues in breast, colon, bone, and fat, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., immune, connective, or cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder: " 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 138 as residues: Ser-74 to Ala-84, GIn-156 to Tyr- 
161, Tyr-184 to Asn-189, Ser-218 to De-223, Pro-299 to Ser-308, His-359 to Thr-368, 
Tyr-390 to Asp-404. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in lymph node breast cancer and colon carcinoma and 
homology to prolyl 4- hydroxylase alpha (II) subunit indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for intervention of connective 
tissue disorders and diseases (e:g. arthritis, trauma, tendonitis, chrondomalacia and 
inflammation), as, well as, in the diagnosis or treatment of various autoimmune 
disorders such as rheumatoid arthritis, lupus, scleroderma, and dennatomyositis as 
well as dwarfism, spinal deformation, andspecific joint abnormalities as well as 
chondrodysplasias ie. spondyloepiphyseal dysplasia congenita, familial osteoarthritis, * 
Atelosteogenesis type H, metaphyseal chondrodysplasia type Schmid. Alternatively,, 
the tissue distribution within various tissue carcinomas and tumor tissues, and 
biological activity reflected by the binding and activation of the GAS promoter 
element indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and treatment of cancer and other proliferative disorders. 

Expression in cellular sources marked by proliferating cells indicates that this 
protein may play a role in the regulation of cellular division. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus this protein may also be involved in apoptosis or tissue , 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as, a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly, 
available and accessible through sequence databases. Some of these sequences are 
■ related to SEQ ID NO:25 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably' excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2065 of SEQ ID NO: 25. b is an 
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integer of 15 to 2079, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:25, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

In an additional embodiment, polypeptides comprising the amino acid 
sequence of the open reading frame upstream of the predicted signal peptide are 
contemplated by the present invention. Specifically, polypeptides of the invention 
comprise the following amino acid sequence: IQPSHAALLHCRSTFRKTECLDPW 
WVRRQLLGMAGIGGLQKMKAPHTGVLHLGSVW 

SGCMSTVRWLNSNrTANRTLSRSVCHVTPLrmSI^PHDGEYLRQMLLNSSSR 
AGEAGSWGY (SEQ ID NO: 283). Polynucleotides encoding these polypeptides are 
■ also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
20. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 20. 

This gene is expressed primarily in fetal liver, and, to a lesser extent, in a 
variety of fetal and other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to r liver disorders and cancers (e.g., hepatoblastoma, hepatitis, liver 
metabolic diseases and conditions that are attributable to the differentiation of ' 
hepatocyte progenitor cells). Similarly, polypeptides and antibodies directed to these 

F ' . 

polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver, expression of this gene at significantly higher 
or lower levels is routinely detected in certain tissues or ceil types (e.g., hepatic, or 
cancerous and wounded tissues) or bodily, fluids (e.g., lymph, bile, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to' the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 1 39 as residues: Ser-67 to Tyr-75. Polynucleotides 
encoding said polypeptides are also provided. : 

The tissue distribution in fetal liver indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for detection and treatment of liver 
disorders and cancers (e.g., hepatoblastoma, jaundice, hepatitis, liver metabolic > 
diseases and conditions that are attributable to the differentiation of hepatocyte 
progenitor cells). In addition the expression in fetus would suggest a useful role for 
the protein product in developmental abnormalities, fetal deficiencies, pre-natal 
disorders and various wou Id-healing models and/or tissue trauma. Furthermore, the ' 
protein may also be used to determine biological activity, to raise antibodies,* as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
1 interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are ■ 
related to SEQ ID NO:26 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 1933 of SEQ ID NO:26, b is an 
integer of 1 5 to 1947, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 26, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

The translation product of this gene shares sequence homology with human' 
laminin B 1 which is thought to be an important structural extracellular matrix 
component involved in cell migration and signalling, paricularly in stimulating ■ 
epithelial cell growth and differentiation (See, Gcnbank Accession No gill 86837). 
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Preferred polypeptides of the invention comprise the following amino acid 
sequences: CSS PPG R LPWC W"f A PRTLG KHG S LI ST L R LTA PLH LA WKM M LS 
RKALFVLLNTPV LFHALEGRLFSKLCHHHTIQRTLTV PKFRSS (SEQ ID NO: 
284), RSPTSRVQLLKRQSCPCQRNDLNEEPQHFTHYAIYDFIVKGSCFCNG 
HA DQCI PV HG FR PV KA PGTFHM V HG KCM (SEQ ID NO: 285), and/or HNTAG 
SHCQHCAPLYNDRPWEAADGKTGAPNECRTCKCNGHADTCHFDVNVWEAS 
GNRSGGVCDDGQHNTEGQYCQRCKPGFYRDLRRPFSAPDACKPCSCHPV 
GSAVLPANSVTFCDPSNGDCPCKPGVAGRRCDRCMVGYWGFGDYGCRP 
CDCAGSGDPITGDCISSHTDIDWYHEVPDFRPVHNKSEPAWEWEDAQGFSAL 
LHSGKCECKEQTLGNAKAFCGMKYSYVLKIKI^ 

KVLKSTKLKIFRGK.ANIISRIMDGO RMHLSNPQSWFGIPCSRT (SEQ ID NO: 
286). Polynucleotides encoding these polypeptides are also provided. 

Included in this invention as preferred domains arc Laminin-type EGF-like 
(LE) domain signatures, which were identified using the ProSite analysis tool (Swiss 
Institute of Bioinformatics). Laminins are the major nqncollagenous components of 
basement membranes that mediate cell adhesion, growth migration, and 
differentiation. They are composed of distinct but related alpha, beta and gamma 
chains. The three chains form a cross-shaped molecule that consist of a long arm and 
three short globular arms. The long arm consists of a coiled coil structure contributed 
by all three chains and cross-linked by interchain disulfide bonds. Beside different 
types of globular domains each subunit contains, in its first half, consecutive repeats 
of about 60 amino acids in length that include eight conserved cysteines. The tertiary 
structure of this domain is remotely similar in its N-terminal to that of the EGF-like 
module. It is known as a 'LE T or 'laminin-type EGF-like* domain. The number of 
copies of the LE domain in the different forms of laminins is highly variable; from 3 
up to 22 copies have been found. A schematic representation of the topology of the 
four disulfide bonds in the LE domain is shown below. < "' > 
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' C: conserved cysteine involved in a disulfide bond 
'a': conserved aromatic residue 

'G 1 : conserved glycine (lower case = less conserved) 
15 's': region similar to the EGF-like domain 
position of the pattern 



' ■ ' * In mouse iaminin gamma- 1 chain, the seventh LE domain has been shown to 

be the only one that binds with a high affinity to nidogcn. The binding-sites are 
?° 20 located on the surface within the loops C1-C3 and C5-G6. Long consecutive arrays of 

f LE domains in lamihins form rod -like elements of limited flexibility » which determine 

the spacing in the formation of Iaminin networks of basement membranes. We 

; _ derived a signature pattern for the LE domain which covers the C^terminal half of the 

35 , , 

repeat starting with the fourth conserved cysteine. The consensus pattern is as 
25 follows: C-x(l,2)-C-it(5>-<^ 
. involved in disulfide bonds] 
40, Preferred polypeptides of the invention comprise the following amino acid 

sequence: CDDCQHNTEGQYCQRCKPGFYRDLRRPFSAPDACKPC (SEQ ID 
NO: 287) and/or CPCKPGVAGRRCDRCMVGYWGFGDYGCRPCDCAGSC(SEQ 
30 ID NO: 288). Polynucleotides encoding these polypeptides are also provided! 

Further preferred are polypeptides comprising the laminin-type EGF-Iike 
, domains listed above, and at least 5, 10, ! 5, 20, 25, 30, 50, or 75 additional 
contiguous amino acid residues of the sequence encoded by this gene. The additional 
50 contiguous amino acid residues is N-terminal or C- terminal to the laminin-type EGF- 



55 



WO 00/06698 * PC1YUS99/17I30 

5 ■ . , < 

like domain Alternatively, the additional contiguous amino acid residues is both N- 
terminal and C-terminal to the laminin-type EGF-hke dornain. wherein the total N- 
10 and C-terininal contiguous amino acid residues equal the specified number. The 

above preferred polypeptide domain is characteristic of a signature specific to. ' 
*. < 5 Laminih proteins. Based on the sequence similarity, the translation. product of this ; 

gene is expected to share at least some biological activities with Lamin in proteins. , 
15 Such activities are known in the an, some of which are described elsewhere herein, 

■ This gene is expressed primarily in osteoblastic tissues, and cell types, 
including osteoblasts, osteoblastomas and osteoclastomas. Expression is also 
20 ; 10 abundant in vascular-pulmonary tissues such as lung, micro- vasculature, pulmonary, 

' enddithelial and smooth muscle cells. 

,\ Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
25 biological sample and for diagnosis of diseases and conditions which include, but are 

' ■ 15 not limited to, cancer and malignancies (particularly of osteoblastic tissues and 

rhabdomyosarcoma), as well as cardiovascular and respiratory or pulmonary disorders 
such as athsma, pulmonary edema, pneumonia, atherosclerosis, restenosis, stoke, 
thrombosis hypertension, inflammauon and wound healing. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
20 probes for differentia I identification of the tissue(s) or cell type(s). For a nurnberof 
35 disorders of the above tissues or cells, particularly of the cardiorespiratory system., 

and skeletal system expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., skeletal, osteoblast, cardio- 
respiratory, vascular, or cancerous and wounded tissues) or bodily fluids (e.g.. iymph* - 
40 25 pulmonary surfactant, serum, plasma, urine, synovial fluid and spinal fluid) or another 

tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily. 
45 . r fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
, 30 , epitopes shown in SF.Q ID NO: 140 as residues: Ser-28 to Cys-34, Thr-5 1 to Thr-58, 
Tyr-64 to Asn-81, Asp-Ill to Lys-i 16, Asp-145 to Phe-160, Pro-203 to Glu-217. 
50 Polynucleotides encoding said polypeptides are also provided. \ ' . 
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The tissue distribution in osteoblastic tissues and cell types and homology to 
laminin indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment, prevention and diagnosis of cardiovasular and respiratory or 
pulmonary disorders such as asthma, pulmonary edema, pneumonia, atherosclerosis, 
' restenosis, stoke, angina, thrombosis hypertension, inflammation and wound healing. 
As a homolog of laminin. this gene product quite possibly has a role in cell adhesion, 
migration, proliferation, angiogenesis, chondrogenesis, wound healing and 1 
oncogenesis. An EST (Int J Cancer 1996 May 16;66(4):57 1-577) with an identical 
sequence to part of this contig was shown to be differentially expressed in human 
primary myoblasts and embryonal rhabdomyosarcoma and therefore might have an 
important role in the determination or maintenance of the normal phenorype, and thus ' 
its loss is possibly involved in the progression of malignancies, particularly of skeletal 
muscle. Similarly, the homology to a laminin would suggest a role in the detection 
and treatment of disorders and conditions afflicting connective tissues (e.g. arthritis, 
trauma, tendonitis, chrondomalacia and inflammation) in the diagnosis or treatment of 
various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dcrmatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias i.e. spondyloepiphyseal dysplasia 
congenita, familial osteoarthritis, Atelostengenesis type II, metaphyseal 
chondrodysplasia type Schmid. Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the . 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:27 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 



WO 00/06698 '.; PCT/US99/1 7130 

' ■ / 51 

formula of a-b, where a is any integer between I to 3365 of SEQ ID NO:27, b is an 
integer of 15 to 3379, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:27, and where b isgreater than or equal to a + 14. 

"FEATURES OF, PROTEIN ENCODED BY GENE NO: 18 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. ' •." 

In another embodiment, polypeptides comprising the amino acid sequence of 
the.open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the' 
following amino acid sequence: NISSQ YCILKSLEMMISGLFCLLVLFLKFAPENY 
CI^TETT-QMPNRHLRLSKATCYLMKCLLPSYFE (SEQ ID NO: 289). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in placenta, brain, and to a lesser extent, in a 
variety of other tissues and cell types. 

> Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a • 
^biological sample and for diagnosis of diseases and conditions which include, but are . ' 
not limited to, reproductive, behavioral, or nervous system disorders, such as: . .. , 
depression, schizophrenia, Alzheimer's Disease, Parkinson's Disease, Huntington's * 
Disease, dementia, paranoia, addictive behavior, epilepsy, transmissible spongiform 
encephalopathy (TSE), Creutzfeidt-Jaicob disease (GJD). Other diseases.and 
conditions related to expression in the placenta might include developmental 
anomalies and fetal deficiencies, ovarian and endometrial cancers, reproductive 
disfunction and pre-natal disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the^tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous and reproductive systems^ 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., neural, reproductive, or cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial 
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fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the. standard gene expression level, i.e.. the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 141 as residues: Ala-16 to Leu -2 2. Polynucleotides 
encoding said polypeptides are also provided. . 

The ussue distribution in brain indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration " and "Hyperproliferative 
Disorders" sections below, in Example 1 L 15, and 18. and elsewhere herein. Rriefly', 
the uses include, hut are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction; 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS^ 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, expression in placenta would suggest a • 
possible role in the treatment and diagnosis of developmental anomalies and fetal 
deficiencies, ovarian and endometrial cancers, reproductive disfunction and pre-natal 
disorders. 

Similarly, expression within embryonic tissue and other cellular sources ! 
marked by proliferating cells indicates that this protein may play a role in the 
regulation of cellular division. Similarly, embryonic development also involves 
decisions involving cell differentiation and/or apoptosis in pattern formation. Thus, 
this protein may also be involved in apoptosis or tissue differentiation and could again 
be useful in cancer therapy. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate. their interactions, in addition to its use as a ' 
nutritional supplement. Protein, as we I fas, antibodies directed against the protein may 
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show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:28 and may have been publicly available prior to conception of > 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention., To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present- invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1992 of SEQ ID NO:28, b is an 
integer of 15 to 2006, where both a and b correspond to the positions of nucleotide ' 
residues shown in SEQ ID NO:28, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

, , The translation product of this gene shares sequence homology with the 
murine transforming protein (See, e.g., Genbank Accession No. 
gil53529bmblCAA36859. II; all references available through this accession are hereby 
incorporated by reference herein). 

In a specific. embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
. following amino acid sequence: PIEGTPAGTGPEFPGRFTRPQRMRSLISSHPCQ 
HLLLLUiLH-ILAILVDVK^ 

NVY (SEQ ID NO: 290). Polynucleotides encoding these polypeptides arc also 
provided. 

This gene is expressed primarily in activated and basal T-cells. 
' *■ Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immunodeficiency, tumor necrosis, infection, lymphomas, auto- 
immunities, cancer, metastasis, wound healing, inflammation, anemias (leukemia) and 
other hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
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polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 1 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individuaJ having such a disorder, relative to the standard gene 
expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. ^ 

The tissue distribution in activated T-cells and the homology to a murine 
transforming protein indicates polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis and treatment of a variety of immune system 
disorders. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 1 1,. i3, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 
lineages, including blood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g., by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou s Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cel! mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and. graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lensc tissue injury, demyeli nation, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells arid committed progenitors of 



WO 00/06698 PCT/LS99/17130 

55. 



various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate igands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
, the specification. Typical of these are cytokine, cell proliferation/differentiation 
modulating activity or induction of other cytokines; 
irmnunosttmulaiin^irrtmunosuppressant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g. for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g. for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g. for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g. for treating 
haemophilia, cardiac infarction etc.); anti-inflammatory activity'(e.g. for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferative disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate 1 igands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 29 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically ' 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general, 
formula of a-b, where a is any integer between I to 3056 of SEQ ID NO:29, b is an 
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integer of 1 5 to 3070 ; where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:29, and where b is greater than or equal to a + 14. ! v' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

, The translation product of this gene was shown to have homology to the Mus 
muscuius ALG-2 protein, which is known to code for a Ca(2+)-binding protein 
' required for T cell receptor-, Fas-, and glucocorticoid-induced cell death. ALG-2 
. mediate Ca(2+ ^regulated signals along the death pathway and may play a role in the 
onset of Alzheimer's Disease (See e.g., Genbank Accession No.gii 12 13520; all 
references available through this accession are hereby incorporated by reference 
herein). . •■. ~* ' . 

Preferred polypeptides comprise the following amino acid sequence: NWVPT 
CLCPSAPCSFFILLSRFKCLFSPQRLTDIFRRYDTDQIXjWlQVSyEQYLSMyF 
IV (SEQ ID NO: 291), and/or QRLTDIFRRYDTDQDGWIQVSYTEQYLSMVFSIV 
(SEQ ID NO: 292). Polynucleotides encoding these polypeptides are also provided. 

When tested against K562 cell lines, supernatants removed from cells 
containing this gene activated the ISRE (interferon-sensitive responsive element). 
Thus, it is likely that this gene activates immune or leukemia cells through the Jaks- 
ST AT signal transduction pathway. ISRE is a promoter element found upstream in 
many genes which are involved in the Jaks-STAT pathway. The Jaks-STAT pathway 
is a large, signal transduction pathway involved in the differentiation and proliferation 
of cells. Therefore, activation of the Jaks : STATs pathway, reflected by the binding of 
the ISRE element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

, A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MFYKLTLILCELSVAGVTQAASQRPLQRLPRHICSQR 
XPPGRCLLKAXLQTTWXXPDKPT PRLSPPLXSDPKR (SEQ ID NO: 293). 
Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
5. Accordingly, polynucleotides related to tHis invention are useful as a marker in 
linkage analysis for chromosome 5. 
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This gene is expressed primarily in placenta, and to a lesser extent, in a variety 
of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
bio:ogicaI sample anc for diagnosis of diseases and conditions which include, but arc 
not limited to, developmental anomalies, fetal deficiencies ovarian and endometrial 
cancers, reproductive dysfunction and pre-natal disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the reproductive system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell , types (e.g., reproductive, developmental, or cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an * 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 143 as residues: Arg-24 to Arg-31, Ile-33 to Gly-4l. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in placenta indicates that polynucleotides and 
j polypeptides corresponding to this gene are useful for the treatment, prevention 
and/or diagnosis of developmental anomalies, fetal deficiencies, ovarian and ' 
endometrial cancers, reproductive dysfunction and pre-natal disorders. Expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
combined with the observed IS RE activity, and homology to the apoptosis linked. 
ALG-2 indicates that this protein may play . a role in the regulation of cellular division, 
and may show utility in the diagnosis, treatment, and/or prevention of developmental 
diseases and disorders, cancer, and other proliferative conditions. Representative uses 
are described in the "Hyperproliferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 
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Dysregulation of apoptosis can result in inappropriate suppression of cell * 
death, as occurs in the development of some cancers, or in failure ( to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
5 potential ro es in proliferation and differentiation, this gene product may have 

applications in the adult for tissue regeneration and the treatment of cancers. It may 
15 . ' aJso act as a morphogen to control cell and tissue type specification, therefore, the 

polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types . 
10 of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
v ' degenerative or proliferative conditions and diseases. The protein is useful in r ' 

modulating the immune response to aberrant polypeptides, as may exist in 
25 proliferating and cancerous cells and tissues. The protein can also be used to gain new 

15 insight into the regulation of cellular growth and proliferation. Furthermore, the t , 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
30 interactions, in addition to its use as a nutritional supplement. Protein, as well -as, 

,.'\ antibodies directed against the protein may show^utility as a tumor marker and/or 
20 imrnunotherapy targets for the above listed tissues. 
35 Many polynucleotide sequences, sue has EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
* - related to SEQ ID NO.30 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 2213 of SEQ ID NO: 30, b is an 
integer of 15 to 2227, where both a and b correspond to the positions of nucleotide 
30 residues shown in SEQ ID NO:30. and where bis greater than or equal to a + 14. ' 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

The translation product of this gene was shown to have homology to the 
human histo-blood group A transferase (See, e.g., Genbank Accession No. 
gblAAD26573.!!AF1344l3_l (AFI34413): all references available through this 
accession are hereby incorporated by reference herein) which is known to represent 
one of the major allogeneic antigens in both erythrocytes and tissues of humans. Its 
been proposed that trie A phenotype is associated with the glycosyltransferas that ; * 
converts the H substance associated with the O phenotype to A through the addition 
of alpha 1-3- N -acetylgalactosamine or alphal-3-galactosyi residues to the H antigen 
Fuc-alphal -2Gal- betai-R. Therefore, the primary productof the histo-blood group A 
is its respective glycosyltransferase. Preferred polypeptides of the invention comprise 
the following amino acid sequence: TSSPVFSFCSMAVREPDHLQ ' ' * 
RVSLPRYNVSASLQWLPCHRIVLQPWHMCAMWELGQVLFHPVAPREGAAPS 
PV STLTWPSSCSHS ESTM ELELQF (SEQ ID NO: 294), LPCHRIV (SEQ !D NO: 
2%), SLQWLPCHRIVLQPVV(SEQ ID NO: 297), and/or MAVREpbHLQRVSLPR 
(SEQ ID NO: 295). Polynucleotides encoding these polypeptides are also provided. . - 1 
This gene is expressed primarily in 1 2-week-oId human embryo. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental anomalies, fetal deficiencies, pre -natal , disorders, 
hematopoietic disorders, or cancer. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological; probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the developing fetus, expression of this gene at 
significantly higher or lower levels Is routinely detected in certain tissues or cell types 
(e.g., hematopoietic, lymph, developing, or cancerous and wounded tissues) or bodily 
fluids (e.g.; lymph, amniotic fluid, serum, plasma, urine, synovial fluid and spinal ' 
fluid) or another tissue or cell sample taken from an individual having such a disorder; 
relative to the standard gene expression level, i.e.; the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 



WO 00/06698 PCT/US99/1713Q 

: 60 

The tissue distribution in 12 week: old embryo indicates that .polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment and diagnosis 
of developmental anomalies, fetal deficiencies, pre-natal disorders and cancer. 
Similarly, expression within embryonic tissue and other cellular sources marked by 
proliferating cells indicates that this protein may play a role in the regulation of 
cellular division. Similarly, embryonic development also involves decisions involving 
cell differentiation and/or apoptosis in pattern formation. Thus, this protein may also 
be involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Alternatively, the tissue distribution and homology to human blood group A 
and fi glycosy [transferase enzymes indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of hematopoietic 
related disorders such as anemia, pancytopenia, leukopenia, thrombocytopenia or 
leukemia since stromal ceils are importan t in the production of cells of hematopoietic 
lineages. Representative uses are described in the "Immune Activity" and "infectious " 
disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the uses include bone marrow cell ex- vivo culture, bone marrow . 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
cecplasia. 

. The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Furthermore, the protein may " 
also be used to determine biological activity, to raise antibodies, as tissue markers, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 4 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences', are publicly 
available and accessible through sequence databases/Some of these sequences are 
related to SEQ ID NO:3 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
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excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1274 of SFQ ID NO:31 ; b is an 
integer of 15 to 1288. where both a and b correspond s the positions of nucleotide 
residues shown in SEQ ID NO:31, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

The translation product of this gene shares sequence homology with CD97 
(EMR1), which is thought to be important in both adhesion and signaling processes - 
early after leukocyte activation (See, e.g., Genbank Accession No. gil784994; all 
references available through this accession are hereby incorporated by reference 
herein). EMR1 belongs to a novel family of G-protein receptors that has recently been 
recognized on the basis of homologous primary amino acid sequences, comprises 
receptors to hormones of the secretin/ vasoactive intestinal peptide/glucagon family, 
parathyroid hormone and parathyroid hormone -related peptides, growth hormone- 
releasing factor, corticotropin-releasing factor, and calcitonin. Proteins with seven 
transmembrane segments (7TM) define a superfamily of receptors (7TMreceptors) 
sharing the same topology: an extracellular N-terminus, three extramembranous loops< 
ori either side of the plasma membrane; and a cytoplasmic C-terminal tail. Upon 
ligand binding, cytoplasmic portions of the activated receptor interact with 
heterotrimeric G-coupled proteins to induce various second messengers, which 
subsequently activate various signal transduction pathways depending upon the 
specific G-coupied protein associated with the receptor. Preferred polypeptides of the 
invention comprise the following amino acid sequence: CFKRKPKREHCSCP 
ITYQSlXJDILNASr^SKRKGMQEVKLNSYVVSGTIGLKEKISLSEPVFLTFRHN 
QPGDKRTKHICVYWEGSEGGRWSTEGCSHVHSNGSYTKCKCFHLSSFAVLV 
AI^PKEDPVLTVITQVGLTISLLCLFLAILTFLLCRPIQNTSTSLHLELSLCLP^LA 
HLLFLTG IN RTEPEV LCS f I AGLLH FLY LACFTWMLLEG LH LFLT V RNLKV AN 

YTSTGRFKKRFMYPVGYGIPAVIIAVSAIVGPQNYGTFTHCWLKLDKGFIWSF 
MGPVAVIILINLVFYFQVLWlLRSKLSSLNKEVSTIQDTRVNfTrTCAISQLFTLGC 
SWGLGFFMVEEVGKTlGSIIAYSFrilOTLQGVLLFVVHCLLNRQVRMEYKKW 
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FSGMRKGVETEStEMSRSTTOTKTEEVGKSSEIFIIKGGTASSSAESTKQPQPQ 
1 VHLVSAAWLKMN (SEQ ID NO: 298), and/or FFWKENLRRNGSREDFARRATQ 
UQSVEl^IWNASFASPGKGQISEFDIVYETKRCNETRENAFLEAGNNTMDINC 
ADALKGNLRESTAVALSUNLLGIF SEQ ID NO: 299. Polynucleotides 
encoding these polypeptides are also provided. 

Included in this invention as preferred domains are two EGF-like protein 
domains, which were identified using the ProSite analysis tool (Swiss Institute of 
Bioinformatics). First, a sequence of about forty amino-acid residues long found in 
the sequence of epidermal growth factor (EGF) has been shown to be present in a 
large number of membrane -bound and extracellular, mostly animal proteins. Many of 
these proteins require calcium for their biological function and a caicium-biriding site 
has been found to be located at the N-terminus of some EGF-like domains. Calcium- 
binding is crucial for numerous protein- protein interactions. We have used the N- 
terminal part of the EGF domain as a consensus pattern. It includes the negative N- 
terminus and the possible hydroxylation site. The consensus pattern is as 
fol!ows:[DEQN|.[DEQN]{2}C.{^^ 
involved in disulfide bonds |. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: DINEChTGLAKCKYKAYCRNKVGGYic (SEQ ID NO: 300): 
Polynucleotides encoding these polypeptides are also provided. Secondly, post- 
translational hydroxylation of aspartic acid or asparagine to form erythro-beta- i 
hydroxyaspartic acid or erythro-beta-hydroxyasparagine has been identified in a 
" number of proteins with domains homologous to (EGF)- Based oh sequence 
comparisons of the EGF-homology region that contains hydroxylatcd Asp or Asn, a 
consensus sequence located in the N-terminal of EGF-like domains has been 
identified that seems to be required by the hydroxyiase(s). The consensus sequence is 
as follows: C.|DN ]. {4} rFY ].C.C, 

' Preferred polypeptides of the invention comprise the following amino acid 
sequence: CRNKVGGYICSC (SEQ ID NO: 301). Polynucleotides encoding these ' 
polypeptides are also provided. 

Further preferred are polypeptides comprising the calcium-binding iEGF-like 
domain and aspartic acid and asparagine hydroxylation site listed above, and at least 
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5, 10, 15, 20, 25, 30, 50, or 75 additional contiguous amino acid residues of the 

sequence referenced in Table I for this gene and the embodiments listed herein. The 

additional contiguous amino acid residues is N-terminal or C- terminal to one or both 

of the listed domains. Alternatively, the additional contiguous amino acid residues is 

both N-termmal and OterminaJ to one or both of the listed domains, wherein the total 

and C-terrninal contiguous amino acid residues equal the specified number. The V 

above preferred polypeptide domains are characteristic of a signature specific to EGF 

like proteias. Based on the sequence similarity and conserved domains, The 

',. translation product of this gene is expected to share at least some biological activities 

with EGF-like proteins. Such activities are known in the art, some of which are 

described elsewhere herein. * 

thus gene is expressed primarily in eosinophils. 

' Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differentia! identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include/ but are 

not limited to, hematopoietic disorders or anemias and leukemias, 

inynunodeficiencies; infection, lymphomas, auto-iminuniiies and cancer. Similarly,, 

polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes tor differential identification of the ussue(s) or cell type(s). For 

a number of disorders of the above tissues or cells,' particularly of the immune and 

hematopoietic systems, expression of this gene at significantly higher or lowerlevels ' 

is routinely detected in certain ussues or cell types (e.g., immune, hematopoietic, or 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 

synovial fluid and spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., . 

the expression level in healthy tissue or bodily fluid from an individual not having the 

disorder. , . 

Preferred polypeptides of the present invention comprise immunogenic 

epitopes shown in SEQ ID NO: 145 as residues: Ser-22 to Ser-30, Pro-33 to Cys-48 ; 

Asp-50 to Lys-67, Pro- 1 17 to Ser- 1 30. Polynucleotides encoding said polypeptides 

are also provided. 
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The tissue distribution in eosinophils combined with its homology 16 a known 
human seven transmembrane domain protein indicates that polynucleotides and 
polypeptides corresponding to 1 this gene are useful for the diagnosis and treatment of 
cancer and other proliferative disorders, particularly considering the fact that the 
majority of 7 transmembrane receptors are tightly associated with signal transduction 
pathways which are integral to the modulation of the cell cycle. As such, the protein 
product of this gene may play a role in the regulation of cellular division, where loss 
of regulation may result in proliferating cells and the onset of tumors or cancer. 
Additionally, the expression in hematopoietic cells and tissues indicates that this 
protein may play a role in the proliferation, differentiation, andVor survival of ; 
hematopoietic cell lineages. In such an event, this gene is useful in the treatment of 
lymphoproliferative disorders, and in the maintenance and differentiation of various 
hematopoietic lineages from early hematopoietic stem and committed progenitor 
cells. Similaily, embryonic development also involves decisions involving cell * 
differentiation and/or apoptosis in pattern formation. Thus this protein may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Similarly, the tissue distribution and homology' to CD97*indicatcs that the 
protein product of this gene might be a marker for differentiation and activation of 
eosinophils, and therefore is useful for the diagnosis and treatment of irnrnune' 
disorders including: leukemias, lymphomas, auto-immunities, immunodeficiencies 
(e.g., AIDS), immuno-supressive conditions (transplantation) and hematopoietic 
disorders; Representative uses are described in the 'Immune Activity" and "infectious' 
'disease" sections below, in Example 11,13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. In addition this gene product is applicable in conditions of general microbial 
infection, inflammation or cancer. Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to , 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. ' 

. Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 32 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or ' 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, wliere a is any integer between 1 to 3266 of SEQ ID NO:32, b is an * ' 
integer of 15 to 3280, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:32, and where b is greater than or equal to a + 14. 1 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

The translation product of this gene has been found to have homology to the 
rat neural F box protein NFB42, in addition to a conserved Cacnorhabditis elegans 
CI4B1.3 protein (See, e.g., Genbank Accession Nos. gil3851648lgb(AAC97505JI 
(AF098301) and gi!558270; all references available through these accessions are 
hereby incorporated by reference herein): Preferred polypeptides of the invention 
comprise the following amino acid sequence: ALCPHPHLILNVTVSPAPSCRHV K 
KVVASPSPSTTMIAMDAPHSKAALDSINELPENrLI^LFraVPARQLLLNCRL ( 
VCSLWRDUDLMTLWKRKCLREGFITKDWDQPVADWKIFYFLRSLHRNLLR 
NPCAEEDMFAWQIDFNGGDRWKVESLPGAHGTDFPDPKVKKYFVTSYEMCL 
KSQLVDLVAEGYWEELLDTFRPDIVVKDWFAARADCGCTYQLKVQLAS A 
DYFVLASFEPPPVTIQQWNNATWTEVSYTFSDYPRGVRYILFQHGGRDTQY 
WAGWYGPRVTNSSIVVSPKMTRNQASSEAQPGQKHGQEEAAQSPYRAyV 
QIF (SEQ ID NO: 302). Polynucleotides encoding these polypeptides are also 
provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
1 . Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1. * 

This gene is expressed primarily in immune cells, especially primary dendritic 
cells, and T cells; and to a lesser extent in a variety of other tissues including breast, 
keratinocytes, epididiymus (cauda), lung, multiple sclerosis, endometrial stromal 
cells, IL4 induced umbilical vein endothelial cells, fetal kidney, fetal dura mater, 
rejected kidney, and osteoblasts. *' ' ■ 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis ot;.diseases and conditions which include, but are 
not limited to, cancer and other proliferative disorders, particularly of the immune 
system or endothelial cells. Similarly, polypeptides and antibodies directed to these - 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an- 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 146 as residues: Pro-41 to Cys-47 t Phe-52 to Gly-59, 
Pro-62 to His-70. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of a 
variety of immune system disorders. Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11,13/14. 
16. 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product in T-cells indicates a role in regulating the proliferation; survival; 
differentiation; and/or activation of hematopoietic cell lineages, including blood stem 
ceils. This gene product is involved in the regulation of cytokine production, antigen 
presentation, or other processes suggesting a usefulness in the treatment of cancer 
(e.g., by hoosting immune responses). 

Since the gene is expressed in ceils of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including anhritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, in flammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities. 
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such as T-ccIl mediated cytotoxicity; immune reactions to transplanted organs and * 
tissues, such as host-versus-graft and graft-versus-hosi diseases,' or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyeiination. systemic * 
lupus erythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover; the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that * * 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of, 
various blood lineages, and in the differentiation and/or proliferation of various cell ' v '. 
types: . - ■ •: • ' . > ' ./ 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate igands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
the specification. Typical of these are cytokine, cell proliferation/differentiation 
modulating activity or induction of other cytokines; 

immunostimulating/immunosupprcssant activities (e.g., for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases a 
regulation of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments arid/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for • 
control of fertility); chembtactic and cherhokinetic activities (e.g. for treating 
infections, tumours); haemostatic or .thrombolytic activity (e.g., for treating - 
haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g., for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferaiivc disease; for regulation of metabolism, behaviour, and many others. - 
Also contemplated is the use of the corresponding nucleic acid in gene therapy ' 
procedures. Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker, and/or immunotherapy targets for the above listed tissues'! 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 33 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b T where a is any integer between 1 to 1283 of SEQ ID NO:33, b is an 
integer of 15 to 1297, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:33. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 24 \ + 

The translation product of this gene shares sequence homology with the . - 
human, mouse, and bovine dopamine hydroxylase which is thought to be important in 
the modification of dopamine, a neurotransmitter (See Genbank Accession Nos. 
gtl30474, gil 162965, and/or gil2358082; all references available through these • 
accessions are hereby incorporated by reference herein). Preferred polypeptides of the 
invention comprise the following amino acid sequence: RQRSWNPGT - / " 
NGY HPNM PDAFLTCETV I FA WA1GG EGFS Y PPHVGLSLGTPLDPH Y VLLEVH 
YDNPTYEEGLIDNSGLRLFYTMDIRKYDAGVIEAGLWVSLFriTi 
QSEGHCTLECLEEALEAEI^SGlHYFAyLLllAHLAGRGI 
LAYDDDroFNhQEFQYLrdEQTILPGDNLITECRYNTlCDRAEMTWGGLSTR : 
' SEMGLSYLLYYPRINLTRCASIPDIMEQLQFIGVKEIYRPVTTVvTniKSPKQYK . .) 
NLSFMDAMNKFKWTKKEGLSFWKLVLSL^ 

SRYRKTL (SEQ ID NO: 303). Polynucleotides encoding these polypeptides are also 
provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MCCWPLLLLWGLLPGTAAGGSGRTYPHRTLLDSEGK 
YWLGVySQRGSQIAFRLQVRTAGYVGFGFSPTGAMASADIVVGGVAHGR 
PYLQDY FTNANRELKKDAQQDYHLEYAMENSTHTliEFTRELHTCDINDKS 
ITDSTV RV I WAY HHE DAGEAGPKYHDSNRGTKSLRLLNPEKTSVLSTALPYF 
DLVNQDVPIPNKDTTYWCQMnCIPVFQEKHHVIKVEPVIQRGHESLyHHILL 
YQCSNNFNDSVPGIRARIAITPTCPN1HSSPV KL (SEQ ID NO: 304) 
Polynucleotides encoding these polypeptides are also provided. 
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v This gene is expressed primarily in brain, the pulmonary system, and to a 
f , lesser extent in kidney. ' • 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents tor differential identification of the tissue(s) or cell tyrje(s) present ina 
5 bialogicai sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurological and behavioral disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differentia! identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues.or ceils, particularly of the nervous system, expression 
' 10 of this gene at significantly higher or lower levels is routinely detected in certain 

tissues or cell types (e.g., neural, endocrine, or cancerous and wounded tissues) or' • 
' bodily fluids (e.g., sputum .lymph, serum, plasma, urine, synovial fluid and spinal 

fluid) or another tissue or cell sample taken from an individual having such a disorder, 
25 .„ , relative to the standard gene expression level, i.e.. the expression level in healthy 

15 tissue or bodily fluid from an individual not having the disorder. 
, , .' ( _ Preferred polypeptides of the present invention comprise immunogenic 

, / epitopes shown in SEQ ID NO: 147 as residues: Ser-33 to Trp-38, Giy-40 to GIy-45, ' 
Asn-93 to Asp-lOS.Thr-UStoThr-nT^lu-lSOtoLys-ie?. Pro-197 to Tyr-203. ' 
Cys-242 to Asn-247, Ser-253 to Tyr-258. His-307 to GIu-314, Glu-357 to Gly-362, . 
. 20 Trp-373 to Gln-378, Ser-402 to Glu-408. Polynucleotides encoding said polypeptides 

i are also provided. * 

w - The tissue distribution in brain and homology to a protein involved in the * 
modification of dopamine indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
25 neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
' . ■ ■ Disorders" sections below, in Example 11,15, and 18. anddsewhere herein. Briefly. 

»; the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease. Parkinson's Disease. Huntington's Disease, Tourcttc Syndrome, \. 
30.. meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
1 trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
50 aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia,' obsessive 
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compulsive disorder, depression; panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
10 patterns, balance, and perception. In addition, the gene or gene product may also play 

a role in the treatment and/or detection of developmental disorders associated with the . 
■5 developing embryo, sexually- linked disorders, or disorders of the cardiovascular 
system. Alternatively, the homology to dopamine hydroxylase indicates that 
15 ' , polynucleotides and polypeptides corresponding to this gene are useful for the 

detection, treatment, and/or prevention of various endocrine disorders and cancers, 
particularly Addison's Disease. Cushing's Syndrome, and disorders and/or cancers of 
the pancreas (e.g., diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, . 
hypopituitarism), thyroid (e.g., hyper-, hypothyroidism), parathyroid (e.g., hyper : ' 
hypoparathyroidism), hypothalamus, and testes. Furthermore, the protein may also ' ' 
be used to determine biological activity, to raise antibodies, as tissue markers, to 
25 isolate cognate iigands or receptors, to identify agents that modulate their interactions, 

15 in addition to its use as a nutritional supplement. Protein, as well as; antibodies 
, directed against the protein may show utility as a tumor marker and/or 
*** ' immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO; 34 and may have been publicly available prior to conception of 
35 the present invention. Preferably, such related polynucleotides are specifically ; 

excluded from the scope of the present invention. To list every related sequence is 
cumbersome.. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
40 25 formula of a-b. where a is any integer between 1 to 2 170 of SEQ ID NO: 34, b is an 

integer of 1 5 to 2 1 84, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:34, and where b is greater than or equal to a + 14; 

45 . , 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 
r 30 When tested against Jurkat T-cell lines, supematants removed from cells 

■* ; ■ * containi ng this gene activated the gamma activating sequence (GAS), a promoter 

50 element found upstream of many genes which are involved in the Jak-STAT pathway. 
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The Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation 'and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate ■ 
proteins involved in the proliferation and differentiation of cells. Thus, it is likely that 
this gene activates T-cells through the JakStat signal transduction pathway. 

This gene is expressed in a variety of human normal and diseased tissues 
including breast, infant adrenal gland, skin tumor, colon,; pituitary, Wilms tumor, and 
to a lesser extent in other tissues. . 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
: not limited to, breast cancer and other proliferative disorders, afflicting endocrine or 
endothelial tissues. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probe's for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endocrine system or of breast and/or breast lymph 
nodes, expression of this gene at significanUy higher or lower levels is routinely 
. detected in certain tissues or cell types (e.g., reproductive, endocrine: or cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine,, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy ursuc or bodily fluid from an individual not having the disorder. 

The tissue distribution in breast, infant adrenal gland, skin tumor, colon, 
pituitary, and Wilm's tumor, and biological activity in activating the GAS promoter 
, element indicates that polynucleotides and polypeptides corresponding to this gene * 
are useful for the detection, treatment, and/or prevention of* various endocrine • 
disorders and cancers, particularly Addison's Disease, Cushing's Syndrome, and 
disorders and/or cancers of the pancreas (e.g., diabetes niellitus), adrenal cortex; 
ovaries, pituitary (e.g., hyper-; hypopituitarism), thyroid (e.g. , hyper-, ' v , 

hypothyroidism); parathyroid (e.g., hyr^hypoparathyroidism) , hypothalamus! and 
testes. Alternatively, the tissue distribution and biological activity indicates that 
polynucleotides'and polypeptides corresponding to this gene are useful for the 
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diagnosis and treatment of cancer and oiher proliferative' disorders. Expression within' 
embryonic tissue and other cellular sources marked by proliferating cells, (i.e., breast, 
skin and Wilm s tumors) indicates that this protein may play a role in the regulation of 
cellular division. Additionally, the expression in hematopoietic cells and tissues 
indicates that this protein may play a role in the proliferation, differentiation, and/or 
survival of hematopoietic ceU lineages. Representative uses arc described in the 
"Immune Activity " and "infectious disease" sections below, in Example '1 1, 13, 14, " 
16, 18, 19, 20, and 27. and elsewhere herein. In such an event, this gene is useful in 
/the treatment of ly mphoprolifcrativc disorders, and in the maintenance and 
differentiation of various hematopoietic lineages from early hematopoietic stem and 
committed progenitor cells. Similarly, embryonic development also involves * 
decisions involving cell differentiation and/or apoptosis in pattern formation. Thus 
this protein may also be involved in apoptosis or tissue differentiation and could again 
be useful in cancer therapy. Furthermore, the protein may also be used to determine , 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition-to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. ' 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases., Some of these sequences are 
related to SEQ ID NO:35 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically * 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
mote polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 935 of SEQ ID NO:35, b is an 
integer of 15 to 949, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:35, and where b is greater than or equal taa +14.' 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 26 
. In another embodiment, polypeptides comprising the amino acid sequence of 

10 the °P en rcadi "g frame upstream of the predicted signal peptide are contemplated by ' 

the present invention'. Specifically, polypeptides of the invention comprise, the 
5 following amino acid sequence: TGTFWSPRSQRRGCCGRRAPRPEAMENGAVYS 

PTTEEDPGPARGPRSGLAAYFFMGRLPLLRRVLKGLQLLLSLLAFICEEVVSQ 
CTLCGGLYFFEFVSCSArT-LSLLILIV 

FLLAS irFVSTI I DRTS AE1A AI VFGFIASFMFLLDFTI^L YEKRQESQLRKPENTT 
: R AEALTEPLN A (SEQ ID NO: 305). Polynucleotides encoding these polypeptides 
10 are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
3. Accordingly , polynucleotides related to this invention arc useful as a marker in 
linkage analysis for chromosome 3. ' 1 - 

* This £ ene is expressed primarily in dendritic cells, and to a lesser extent in 

1 5 melanocytes, fetal liver and spleen and several other tissues. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
b:c!cg:ca: sample and for diagnosis of diseases and conditions which include, but are 
not limited to, inflammation, and disorders of the hepatic and immune systems. 
20 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
35 providing immunological probes for differential identification of the tissue(s) or cell 

type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and hematopoietic systems, expression of this gene at'significantiy higher or 
lower levels is routinely detected in certain tissues or cell types (e.g.. hematopoietic, 
25 hepatic, immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum; plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
45 ' expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 

individual not having the disorder. 
30 Preferred polypeptides of the present invention comprise immunogenic 

epitopes shown in SEQ ID NO: 149 as residues: Phe-63 to Ser-75, Thr-97 to Ser- 102, 
50 * Glu- 1 28 to Arg- 143. Polynucleotides encoding said polypeptides are also provided. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Representative uses arc described in the "Immune 
Activity" and "infectious disease" sections below, in Example 1 1, 13, 14, 16, 18, 19, 
5 y 20. and 27, and elsewhere herein. Briefly, the expression of this' gene product in 
dendritic cells indicates a role in the regulation of the proliferation; survival; 
differentiation; and/or activation of potentially all hematopoietic cell lineages, 
including blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes that may also suggest a usefulness 
0 in the treatment of cancer (e.g. by boosting immune responses). 

. " Since the gene is expressed in cells of lymphoid origin, the natural gene ' 
product is involved in immune functions. Therefore it is also used as an agent for ' 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou s Disease, inflammatory 
5 - bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis,' hypersensitivities, 
such as T-ccll mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis. Sjogren's 
Disease, scleroderma and tissues. More over, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
^ recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem ceils and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in fetal liver indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the detection and treatment 
. of liver disorders and cancers (e.g., hepatoblastoma, jaundice.- hepatitis, liver 
metabolic diseases and conditions that are attributable to the differentiation of 
hepatocytc progenitor cells). In addition the expression in fetus would suggesi a' 
useful role for the protein product in developmental abnormalities, fetai deficiencies, 
pre-natal disorders and various would-heaiing models and/or tissue trauma. , 
Furthermore, the protein may also be used to determine biological activity, raise 
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antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

5 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are , 
related to SEQ ID NO:36 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 

10 ' cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3324 of SEQ ID NO:36, b is an 
integer of 15 to 3338, where both a and b correspond to the positions of nucleotide 
, residues shown in SEQ ID NO: 36, and where b is greater than or equal to a + 14. 

15 ■ • 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 

20 following amino acid sequence: ASAPRVMRGHLAGFPALSGLAS VCLWATFSA 
QLPGPVAATSWTPAPLGCSAARSGPEKRLGTAAPGSAASLAQAGPGAPCRV 
LPVDPAPAALNVREPGWl^GLFDGALLQVlXNFLRKSTDVLMDTRE t \ESLEV 
E (SEQ ID NO: 306). 

In another embodiment polypeptides of the invention comprise the following 

25 amino acid sequence: NKLIISFPVFLSQLLLDRQLLHAPQTLPTPHCGGSSRPGP 
SHPPVVTXIQLPCVHVALWQMLRDFSDSRnTSTLTTQPAAQTAAPAKDQES 
DrVGGEGILCDlAFLX^EDHPLGVGGASAPSSRRELSRRGVHTQTLPElXjTLHG 
TPSSSFDCGIKYIISWPLAPGCDLPSLELSLVCKGVSSCMGFAAG (SEQ ID NO: 
307). Polynucleotides encoding these polypeptides are also provided. 

30 This gene is expressed primarily in endothelial cells, lung, and fetal kidney, 

and to a lesser extent in epididymis, keratinocytes and cerebellum. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a * 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cardiovascular diseases involving endothelial cell disturbances such as 
atherosclerosis. Similarly, polypeptides and antibodies directed to these polypeptides 
' are useful in providing immunological probes for differcritial identification of the 
Jissue(s) or cell type( s ). For a number of disorders of the above tissues or cells, 
particularly of the cardiovascular system, expression of this gene at significantly 
higher or lower levels is routinely detected in certain tissues or cell types (e.g., 
cardiovascular, or cancerous andwounded tissues) or bodily fluids (e.g.; lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. , 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 150 as residues: Arg-47 to Leu-54. Polynucleotides 
encoding said polypeptides are also provided. 

.The ussue distribution in endothelial cells indicates that, polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosing and treating 
disorders of endotheliai cells such as athcroschlerosis, vasculitis, cardiovascular 
disease, and emphysema. The secreted protein can also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions and as nutritional 
supplements. The polypeptide may possess a wide range of undetected biological 
activities. Typical of these are cytokine, cell proliferation/differentiation modulating 
activity or induction oiother cytokines; immunostimulatmg/irrimunosuppressant 
activities (e.g.. for treating human immunodeficiency virus infection, cancer, 
autoimmunediseascs and allergy); regulation of haematopoiesis (e.g., for 
trcatinganaemia or as adjunct to chemotherapy); stimulation of growth of 
bonexartilage, tendons, ligaments and/or nerves (e.g., for treating wounds.siimulation 
of follicle stimulating hormone (for control of fertility); chemotactic and 
chemokinetic activities (e.g., fur treating infections, tumours): haemostatic or 



WO 00/06698 



PCT/US99/ni30 



thrombolytic activity (e.g., for treating haemophilia, cardiac infarction etc.); anti- 
inflammatory activity (e.g., for treating septic shock, Crohn's Disease); as 
antimicrobials; for treating psoriasis or other hyperpnoliferative disease; for regulation 
of metabolism, behaviour, and many others. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 
directed against the protein may show utility as a rumor marker and/or 
1 immunotherapy targets for the above listed tissues 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases: Some of these sequences are 
related to SEQ ID NO:37 and may have been publicly' available prior to conception of s 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1549 of SEQ ID NO:37, b is an 
integer of 15 to 1563, where both aand b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:37, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN KNCODED BY GENE NO: 28 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by . 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: PGRPTRPTKNK VC VCLGM LFW A YPIC VFIDS L 
SCQPCLWSTGATSHFNSPTTSPLFTLFMPCALAPNPFT QLGKLDDR (SEQ ID 
NO: 308). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in meningima. 

Therefore, polynucleotides and polypeptides of the invention are useful as - 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, tumors or disorders of the central nervous system. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differenual identification of the tissues) or ceil type(s). For 



WO 00/06698 PCT/US99/17130 

. • 78 , . 

j • ■ ■■ ' r 

, a number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels is routinely,' 
. detected in certain tissues or cell types (e.g., neural, or, cancerous and wounded '\ 
" * tisSUes) or b0 <% fluids (e.g;, lymph; semm, plasma, urine, synovial fluid and spinal 
5 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. * 

Preferred polypeptides of the present invention comprise, immunogenic 
; epitopes shown in SEQ ID NO: 151 as residues: His^ toThr-34. Polynucleotides 
3 encoding said polypeptides are also provided. 

The tissue distribution in mcningima indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflarnmatory 
conditions. Representative uses are described in the "Regenera^on ,, and 
/■/."Hyperprolif^ ^ ^ 

elsewhere herein. Briefly, the uses include, but are not limited to the detection, 
treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Hvr.tir.g'or/s disease, Tourette Syndrome, meningitis, encephalius, demyeiinating 
diseases, peripheral neuropathies, neoplasia, trauina, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors including < , : 
disorders in feeding, steep patterns, balance, .and perception, as well asdisorders of 
the meninges such as meningioma and meningitis. In addition, the gene or gene 
product may also play a role in the treatment and/or detection of developmental 
disorders associated with the, developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. Funhermore, the protein may also beused to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement , Protein, as well us, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 
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Many polynucleotide sequences/such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 38 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general , 
formula of a-b, where a is any integer between 1 to 1 034 of SEQ ID NO: 38. b is an 1 
integer of 15 to 1048, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO:38, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

The translation product of this gene has been shown to encode a human brain 
specific mitochondrial carrier (Genbank Accession No. gi(385 1540lgblAAD0434<x II 

1 5 (AF078544); all references available through this accession are hereby incorporated 
herein by reference) which shares sequence homology with the human body weight 
disorder associated gerie C5 product which is known to be differentially expressed in 
obese compared to lean mice (See GcneSeq Accession No. R9I2S1)! Based on the ' 
sequence similarity^ the translation product of this gene is expected to share at least 

20 ♦ some biological activities with mitochondrial carriers proteins. Such activities are ' j 
known in the art, some of which are described in Sanchis et al, J. Biol. Chem. 
; 273:346 11-34615 (1998), incorporated herein by reference. 

Included in this invention as preferred domains are mitochondria] energy 
transfer protein (METP) domains, which were identified using the ProSite analysis 

25 tool (Swiss Institute of Bioinformaiics). Structurally, members of the family of 

mitochondrial energy transfer proteins consist of three tandem repeats of a domain of 
approximately one hundred residues. Each of these domains contains two 
transmembrane regions. As a signature pattern, we selected one of the most conserved 
regions in the repeated domain, located jusrafter the first trans me mbrajie region. To 

30 detect this widespread family of proteins, a consensus sequence was developed that 
contains the most conserved regions in the repeated domain! The consensus pattern is • 
as follows: PTDE].[UVAT1[RKJ.[LRH][LIVMFY1[QMAIGV]. 
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Preferred polypeptides of the invention comprise the following amino acid 
sequences: PVDLTKTRLQ (SEQ ID NO: 309) arid PTDVLKIRMQ (SEQ ID NO: 
310). Polynucleotides encoding, these polypeptides arc also provided. 

( . Further preferred are polypeptides comprising the METP domains of the 
5 sequence listed above, and at least 5. 10. 15. 20, 25, 30, 50, or 75 additional 

contiguous amino acid residues of the sequence referenced hi Table I for this gene. 
; , The additional contiguous amino acid residues is N^e^^ 

METP domain. Alternatively, the .additional contiguous amino acid residues is both 
N-terrninal and C-terminal to the METP domain, wherein the total N- and C-terminal 
0 contiguous amino acid residues equal the specified number. The above preferred - 
polypeptide domain is characteristic of a signature specific to muochondrial energy 
• transfer proteins. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
X. Accordingly, polynucleotides related to this invention arc useful as'a marker in 
. linkage analysis for chromosome X. 

This gene is expressed primarily in brain, and to a lesser extent, in T-cells. , 
Therefore, polynucleotides and polypeptides of the invention are useful as 
«ajKits.for differentia: identification of.the tissiic(s)-orcell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurological and behavioral disorders and immune disorders and/or 
obesity. Similarly, polypeptides and antibodies directed to these polypeptides are' 
useful in providing immunological probes for differential identification of the 
., tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
.particularly of the disgestive, immune, and nervous systems, expression of this gene 
at significantly higher of lower levels is routinely detected in' certain tissues or cell 
types (e.g:. immune, neural, or cancerous arid Wounded tissues) or bodily fliiids (e!g., 
lymph, serum. plasrha. urine, synovial fluid and spinal fluid) or another tissue or cell 
sample laken from an individual having such'a disorder, relative to 'the standard gene ' 
expression level, i.e., the.expression level in healthy tissue ; or bodily fluid from an 
individual not having the disorder. " 
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Preferred polypeptides of the present invention comprise immunogenic * 
♦ ; , • e P«opcs shown in SEQ ID NO: 152 as residues: Glh-189 to Gly- 195. Polynucleotides . 

10 . / coding said polypeptides are also provided. 

The tissue distribution in brain and homology to 'mitochondrial carrier proteins 
5 indicateipolynudeotides and polyijeptides corresponding to,this gene arc usefulfor 
tiie detection, treatment, and/or prevention of neurodegenerative disease states, 
behavioral disorders, or inflammatory conditions. Representative uses are described in 
. the "Regeneration 1 and 'Hyperproliferauve Disorders" sections below, in Example 
Il,<15, and 18, and elsewhere herein. Briefly, the uses include, but are not li railed to 
the detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's' ' - 
Disease, Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, 
demyelinating diseases, peripheral neuropathies, neoplasia, trauma, congenital >. 
malformations, spinal cord injuries, ischemia and infarction, aneurysms, hemorrhages, 
schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder, depression, 
panic disorder, learning disabilities, ALS, psychoses, autism; and altered behaviors, 
including disorders in feeding, sleep patterns, balance, and perception. In addition, 
elevated expression of this gene product in regions of the brain indicates it plays a 
role in normal _~ura! function. 

, Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
35 . . survivaL Funhermorc, the protein may also be used to Seierminc biological activity, 

to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate iheir interactions/ in. addition to its use' as a nutritional- 
supplement, Protein, as well as, antibodies directed against the protein may show ^ 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues/ 

Many polynucleotide sequences, such as EST sequences, are publicly 1 
available and accessible through sequence databases. Some of these sequences are 
45 ' rclatCd t0 S ^ ID NO:39 and "}W nave been publicly available prior to conception of 

the present invention. Preferably, such related polynucleotides are specifically '\' ■ 
excluded from the scope of die present invention: To list every^eiatediequence'is • " 
cumbersome. Accordingly , preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general " ' 
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formula of a-b, where a is any integer between 1 lo 14 16 of SEQ ID NO:39 ( b is an 
integer of 15 to 1 430. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 39, and where b is greater than or, equal to a + 14. " * 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: NTTFGSTISPTSTHASPSLGFCCSWLLEDLEEQLYCSAFEEAALTR 
* RICNITSCWLPLDMELLHRQVI^ 

GSRCLLTASHPCTEKRRKASAXQRNLGYPLAMLCLLVLTGLSVLIVAIHILEL 
LIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQVVLIFYLMVSSVVGFY 
RSLRPRWHDTAN1TQ[IGNCVCLLVLSSAU>VFSRTLGLTRFDLLGDFGRFNWL 
GNFYlVn.YNAAFAGLTTLCLVKTmAVRAELrRAFGLDRLPLPVSGFPQA^ 
' RKTQHQ (SEQ ID NO: 311): Polynucleotides encoding such polypeptides are also ' 
provided. 

This gene is expressed primarily in immune system tissues (e.g.' resting T-. ■, 
cells, primary dendritic cells, and neutrophils, apoptotic T-cells) and umbilical vein! 
This gene is expressed to a lesser extent in the gastrointestinal tissue (e.g. small 
i=ttsti-e. colon), brain (e.g. cerebellum, frontal cortex), aorta endothelial cells, skin 
tumor, embryonic tissue, thymus, and cancers (e.g. cheek, breast, synovial). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissues) or cell type(s) present in a , ' 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and immune disorders: Similarly, polypeptides and antibodies 
directed to these.polypeptides are useful in providing immunological probes for 
differential identification of the tissued) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system and gastrointestinal tract 
expression of this gene at significantly higheror lower levels is routinely detected in 
certain tissues or cell types (e.g.. immune, gastrointestinal, cancerous and wounded • 
tissues) or. bodily fluids (e,g.,' amniotic, serum, plasma, urine, synovial fluid and 
spinal fluid) or another tissue or cell sample taken from an indi vidual ha ving such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of , he present invention comprise immunogenic 

10- - encoding said polypeptides are also provided. 

„ Thetissue distribution in immune cells (e.g. T-cells, dendritic cells 

• 5. neutrophils) indicates Polynucleoudes and po.vpept.des corresponding to this gene are 

■ useful for the diagnosis and treatment of a variety of immune system d.sorders 
. Reptcsenadve uses are described* ,he ^ ^ 

sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein 

Bncfly, the expression of this gene product indicates a role in regulating the 
'Q proliferate; survival: differentiation; anoVor activation of hematopoietic cell • 

imeages, including blood stem cells. This gene product is involved in the regu.ation of 
'. ^"^^-"^' ^^P^entauon. oro.her processes suggesting a usefulness 
n the treatment of cancer (e.g. by boosting immune responses,. 
... • Since the gene is express^ in cefe of lymphoid origin, the natural-gene 

produc t K,i n volvedini m munefunc«ion.s.Therefo re itisalsouseful;asana g e«f 0 r ' 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
asAIDS. ieukemia. rheumatoid arthritis, o^nutorm.ou's Disease, inllammatory 
; bowel dtsease. sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities • 
such as T-cel. mediated cytotoxicity; .mmune reactions .o transplanted organs and . 
Ussues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 

. D.se M e. and scleroderma. Moreover, the protein may represent a secreted factor tha, 
' ' nfl «««e S the differentiation or behavior of other blood cells, or that recruits 

25 ■h^ieuccelteto^^ 

the expansion of stem cel.s and committed progenitors of various blood lineage, and 
■n the differentiation and/or proliferation of various cel. types. The t.ssue diaributioa 

« , r km ^° re ' andcanc ™ 

ih«poly™,c!e^ . 

dtagnosisandtreatrnentofcancerandotherproliferat.vedisorders.Expressionin " 
• ' ^sources a-ch as embryonic tissue marked hy pMiferating ceJ.s indict chic 

50. *« pro** may play a ro, e in . ^Mon of cellular division. Additionally the ' 
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. ■ ■ expressions hematopoietic cells and tissues indicates that this protein may play a 

role in the proliferation, differentiation, and/or survival of hematopoietic cell lineages 
10 . • ^ SUCh an eVem - ,his is "^ful in the treatment of lymphoproliferatjve disorder 

and in the maintenance and differentiation of various hematopoietic lineages f rom 
5 early hematopoietic stem andcortmuied progenitor cells, SimiMy.- einbryonic''.'"- 
1$ ' ' deVCl °P ment ^ involve, decisions involving cell differentiation and/or apoptosis in 

, • pattern format,on. Thus this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in.cancer therapy. Protem, as well as, 
. antibodies directed against the protein may show utility i a tumor marker and/or 7 

20 ,. 10 ;'.™m™«lierapy ; targe K for the above listed tissues. The tissue distribute, in ".. 

7. . cerebellum andfrontai coitcx indicate* polynucleotides and polypeptides ... * ' 

corresponding to this gene axe useful for the detection, treatment! and/or prevention of. 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproiiferative 
Disorders" sections below, in Example 1 1, 15, and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection, treatment, and/or prevention of ' 
Alzheimer's Disease. Parkinson's Disease. Huntington's Disease. Tourecte Syndrome! 
me.-r.gitis. encephalitis, demyeUnating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malfonnations, spinal cord injuries, ischemia and infarction, 
aneurysms , hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
35 compulsive disorder, depression,' panic disorder, learning disabilities, ALS, 

psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
'.patterns, balance, and perception,' In additioi elevated expression of this gene product 
in regions of the brain indicates it plays a role' in normal neural function.; 
25. Potentially, this gciie product is involved in synapse formation, 

neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
45 ' * raise-amibbdies. as tissue markers, to isolateccW ligands or : receptors, to ' 

identify agents that modulate their interactions. i„ addition to its use as a nutritional 
., 30 supplement. Protein, as well as, antibodies directed against the protein may show 

, utility as a tumor marker.and/or immunotherapy targets for the above listed tissues 

50 ' ■ • ■■' ■ . 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

- ■ related to SEQ 1 D NO 40 and may have been publicly available priorto conception of 
the present invention: Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present* invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 

, formula of a-b, where a is any integer between 1 to 2089 of SEQ ID NO:40, b is an 
integer of 15 to 2103, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:40, and where b is greater than or equal to a +14. * 

FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

The polypeptide of this gene has been determined to have a zinc finger ( Zinc finger, 
C2H2 .type) domain at about amino acid position 16-50 of the amino acid sequence 
referenced in Table. I for this gene: Therefore, 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: LCVCLVYLCMYGVCLCVIVCVSGVSLCLYVWGVSVC 
DCVSVFMCVGLCVIFCVYGKPRTEHYHSPHLAKQKAraEMCGRHDVSAAGIF 
QSYV (SEQ ID NO: 3 12).. Polynucleotides encoding these polypeptides are also 
provided. 'Zinc finger' domains are nucleic acid-binding protein structures first 
identified in the Xeriopus. transcription factor TFIIIA. These domains have since been 
found ^numerous nucleic acid-binding proteins. A zinc finger domain is composed 
of 25 to 30 amino-acid residues. There are two cysteine or histidine residues at both 
extremities of the domain, which are involved in the tetraherlral coordination of a zinc 
atom. Ir/has been proposed that such a domain interacts with about five nucleotides. A 
schematic representation of a zinc finger domain is shown below: 
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Many classes of zinc fingers are characterized according to the number and 
positions of the histidine and cysteine residues involved in the zinc atom coordination. 
In the first class to be characterized, called C2H2, the first pair of zinc coordinating 
residues are cysteines; while the second pair are histidines. A number of experimental 
reports have demonstrated the zinc- dependent DNA or RN A, binding property of 
some members of this class. Some of the proteins known to include C2H2-type zinc 
fingers are listed below. We have indicated, between brackets, the number of zinc 
finger regions found in each of these proteins; a V symbol indicates that only partial 
. sequence data is available and that additional finger domains is present - 
This gene |s expressed primarily in salivary gland. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample^ and for diagnosis of diseases and conditions which include, but are 
not limited lo, salivary gland related diseases, diseases of the mouth, and other s 
digestive disorders. Similarly, polypeptides and antibodies directed to these ' 
■. polypeptides are useful in providing immunological probes for differential 
identification of the tissues) or cell typeCs). For a number of .disorders, of the above 
tissues or cells, particularly of the digestive system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily' fluids (e.g., saliva, serum, plasma, 
urine, synovial fluid and spinal fiuidydr another tissue or cell sample taken from an < 
individual having such a disorder^ relative to the standard gene expression. level, i.e., 
the expression levej in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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Preferred polypeptides of" the present' invention comprise immunogenic 
' ep " 0peS Sh ° wn in SE< 3 IP 154 as residues: Gly-46 to His-54. Polynucleotides 
encoding said polypeptides are' also provided. 

, The tissue distribution indicates that the protein products of this gene are ''. 
useful for diagnosis and treatmcm of salivary gland related diseases (mumps, calculi 
formation in ducts, sarcoidosis, facial palsy, tumors, Sjogren* Syndrome) and other 
digestive system disorders. Furthermore, the protein may also;be used to determine 
biological activity, raise antibodies, as tissue markers, to isolate cognate ligands or " 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the proteiamay 
show utility as a tumor marker and/or unmunothcrapy targets for the above listed' 



tissues. 



: Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are • 
related to SEQ ID NO*! and may have beet, publiciy available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded fronuhe present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general ; 
formula of a-b. where a is any integer between 1 to 2335 of SEQ ID NO:4 l , b is an 
integer of 15 to 2349. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:41, and where b is greater than or equal to a i 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

This gene is expressed primarily in fetal tissue (t.g. spleen, liver, brain), 
cancerous tissues (e.g. ovarian, colon, stomach, parathyroid) and to a lesser extent in ' 
. immune cells and tissue (e.g. B-cells, T-cells. bone marrow'), ahd reproductive organs. \ 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present m a - 
biological sample and for diagnosis of diseases and conditions which inciude. but are 
not limited to. cancer, particularly of the colon and ovaries, disorders of the 
developing fetus, neurodegenerative conditions, and immune system disorders.' 
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Similarly, polypeptides and antibodies directed to these polypeptide* ore useful in 
Priding immunological probe., for differential, identification of the ti S sue(s) or cell 
lype(s). For a . number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., irnrhune, reproductive, neural 
cancerous and wounded tissues) or bodily fluids <e t g., serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an' individual having 
such a disorder, relative to the standard gene expression level; i.e., the expression 
, level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic • 
epitopes shown.ih SEQ ID NO: 155 as residues: Lys-35 to Lys-47. Polynucleotides ' 
encoding said polypeptides are also provided. .' ' . 

The expression of this gene within fetal tissue and other cellular sources marked by . 
proliferating cells indicates this protein may play arolc in the regulation of cellular , 
5. -division, and may. stow utility in the diaj^i^'treatment.Mdfar pontic of ' 
developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the 'Hyperproiiferative Disoolen" 
^Regeneration" 'sections be-oW and elsewhere herein. Briefly, developmental : 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
!0 formation. * 

• Dysregulation of apoptosis can resuil in inappropriate suppression of cell - . 

death, as occurs in the development of some cancers; or in failure to control the extent' 
Qf cell death, as is believed to occur.in acquired immunodeficiency , and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles m-Rrolifcnuton and differentiation, th^gene-product may have • • ' 
applications in the adult for tissue regeneration and the trcatmenwf cancers. It may .'. 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention ore useful in treating. ' 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
' differentiation and is useful in the detection, treatment, and/or prevention of . ' 
degenerative or proliferative conditions and diseases. The'protcin is useful in 
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modulating ihe immune response to aberrant polypeptides, as may exist in' - 
: proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferate. The tissue distribution 
in immune cells (such as T-ceJis and B-cells) and immune tissues (bone marrow) 
5 indicates polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis and treatment of a variety of immune system disorders. Representative 
uses are described in the ."Immune Activity" and "infectious disease" sections below, . 
.in Example 1 1, 13, 14, 16,. 18, 19, 20, and 27, and elsewhere herein. Briefly; the 
expression of this gene product indicates a role in regulating the proliferation; 
10 survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

. Since the gene is expressed in cells of lymphoid origin, the natural gene 
15 product is involved in immune functions. Therefore it is also useful as an agent for ' 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
0 tissues, such as host-versus-graft and graft-versus-host diseases, of autoimmunity 
. disorders, such as autoimmune infertility, lense tissue injury demyeiinauon, systemic 
lupus erythematosis, drug induced hemolytic anemia,' rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
5 hematopoietic ceils to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/ur proliferation of various ceil types. The tissue distribution 
in parathyroid indicates polynucleotides and polypeptides corresponding to this gene 
are useful for the detection, treatment, and/or prevention of various endocrine 
disorders and cancers. Representative uses are described in the "Biological Activity", 
"Hyperproliferative Disorders", and "Binding Activity'' sections below, in Example 
1 1, 17, 18, 19, 20 and 27. and elsewhere herein. Briefly, the protein can be used for 
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the detection/treatment, and/or prevention of Addison's Disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes melliius), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism)., parathyroid (e.g. hyper-, hypoparathyroidism) . hypothalamus, and 
testes. Additionally, the tissue distribution in brain tissue indicates polynucleotides 
> and polypeptides corresponding to this gene are useful for the detection, treatment,, 
and/or prevention of neurodegenerative disease states, behavioral; disorders, or 
Inflammatory conditions. Representative uses are described in the "Regeneration" and 
"Hyperproliferative Disorders" sections below, in Example 1 h 15; and 18, and 
elsewhere herein. Briefly, the uses include, but arc not limited to the detection, • 
treatment, and/or prevention of Alzheimer's Disease. Parkinson s Disease, 
Huntington's Disease,, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma/congenital malformations; spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, J ; 
mania, dementia, paranoia; obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene procuctin regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition; homeostasis, or neuronal; differentiation or ' 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate iigands or receptors, to 
identify, agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
t utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:42 and may have been publicly available prior to conception of ' 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every- related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
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more polynucleotides comprising a nucleotide sequence described by the general. 
, formula of a-b, where a is any integer between I to 1545 of SEQ ID NO:42, b is an 
integer of 1 5 to 1 559, where both a and b correspond to the positions of nucleotide 
, residues shown in SEQ ID NO:42, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

When tested against U937 Myeloid cell lines, supernatant* removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
; myeloid cells through the Jak-STAT signal transduction pathway. The gamma 
activating sequence (GAS) is a promoter element found upstream of many genes- 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of. the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

This gene is expressed primarily in skin tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents fcr differentia: identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, skin disorders, particuiary skin cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissues) or cell type(s). For a number of 
disorders of the above tissues or cells/ particularly of the skin, expression of this gene 
at significandy higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., cancerous and wounded tissues) or bodilyiluids (e.g., serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

. the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. . J 

Preferred polypeptides of the present invention comprise immunogenic 

epitopes shown in SEQ ID NO: 156 as residues: Pro-38 to Gly-44. Pheo6 to Thr-64. • 

Polynucleotides encoding said polypeptides are also provided. 
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. * The tissue distribution in skin indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the treatment, diagnosis, and/or prevention 
10 of various skin disorders including congenital disorders (i.e.. nevi, moles, freckles, 

Mongolian spots, hemangiomas, port-wine syndrome )V integumentary tumors (i.e. 
5 keratoses, Bowen's Disease, basal cell carcinoma, squamous cell carcinoma! 

malignant melanoma, Paget's Disease, mycosis fungoides, and Kaposi's sarcoma), 
injuries and inflammation of the skin (i.e. wounds, rashes, prickly heat disorder, 
psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, autoimmune 
disorders (i;e. lupus erythematosus, vitiligo, dermatomyositis, morphea, scleroderma, 
pemphigoid, and pemphigus), keloids, striae, erythema, petechias purpura, and 
xanthelasma. Moreover, such disorders may predispose increased susceptibility to 
viral and bacterial infections of the skin (i.e. cold sores, warts, chickehpox, 
molluscum comagiosum, herpes zoster, boils, cellulitis, erysipelas; imr^tigo. dnea, 
25 ' ° althletes foot, and ringworm). Protein, as well as, antibodies directed against the 

15 protein may show utility as a tumor marker and immuno therapy targets for the above 
listed tumors and tissues, , ; . 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences arc - 
' t related to SEQ ID NO:43 and may have been publicly available prior to conception of 
20 the present invention. Preferably, such related polynucleotides are specifically 
35 excluded from the scope of the present invention. To list every' related sequence is ' ' 

cumbersome: Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 752 of SEQ ID NO:43. b is an 
integer of 15 to 1766, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NC):43, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

The translation product of this gene shares sequence homology with mitogen- 
induced prostate carcinoma (mouse)- which is thought to be important in the etiology 
of c ^cer. In this respect, this gene is mitogen- induced and/or involved in cell 
50 proliferation. * ' 
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; Preferred polypeptides of the invention comprise the following amino acid 
sequence: GHMPYGWLTEIRAVTPAFDKNW - 
TFVCMALSLTLCr^MTOTPNVSEK^^ 
VRAHLTGWLMTLKiCTFVLAPSSVLRIIVLMSLVyLPYLG 
GFVGESTMVAlAACYVy^QKKKMENESATEGEDSANTTDMPFTEE 
MREENE (SEQ ID NO: 313) and/or Q V VFV A1LLH SHLECREPLLIPILSL YMG A 
LVRCTTLGLGYYK^IHDIlPDRSGPELGGDAnRKiMLSFV^ 
PIVNLFVSRDLGGSSAATEAVAILTATYPV (SEQ ID NO: 314). Polynucleotides 
encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on. chromosome. 
5. Accordingly, polynucleotides related to this invention are useful as a marker in * 
linkage analysis for chromosome 5. 

; This gene is expressed primarily in early infant and adult brain, retina, fetal 
tissue (e.g., liver, speen, whole embryo) and to a lesser extent in immune cells (e.g., 
monocytes and T-ceils), colon, and parathyroid tumor tissue . 

Therefore, polynucleotides and polypeptides 1 of the invention are useful' as 
reagents for differential identification of the tissue(s) or cell rype(s) present in a • 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, disorders of the immune system and nervous sy stem- 
Similarly, polypeptides and antibodies directed to these polypeptides are useful io 
providing immunological probes for differential identification of the tissuc(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
metabolic system (cancers), expression of this gene at significantly higher or lower 
levels is routinely detected in certain tissues or ceil types (e.g.; immune, neural, 
cancerous and wounded tissues) or bodily fluids (e.g.; serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e.; the expression, 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 157 as residues: Arg-122 to Ser- 139,- Mel- 144 to Glu-. 
149. Polynucleotides encoding said polypeptides are also provided. 
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The tissue distribution and homology to mitogen induced prostate carcinoma 
(mouse) indicates that poly nucleotides and polypeptides corresponding to this gene 
are useful for the study and treatment of cancers, including but not limited to the 
colon, parathyroid, and adrenal glands. Moreover, the expression within fetal tissue \ 
and other cellular sources marked by proliferating cells indicates this protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis, 
treatment, and/or prevention of developmental diseases and disorders, including' " 
cancer, and other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders'* and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 1 
and/or apoptosis in pattern formation. , ' 

Dysregulation of apoptosis can result in' inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control, the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers/lt may 
also act as a morphogen to control ceil and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said. disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue ',V 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in . 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation pf cellular growth. and proliferation. /The tissue distribution 
in immune cells (T-cells, monocytes) indicates polynucleotides and polypeptides v 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders,, Representative uses arc described in the "Immune 
Activity" and "infectious disease" sections below, in Example 1 1, 13, 14, 1 6, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product 
indicates a role in regulating the proliferation; survival; differentiation; and/or 
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activation of hematopoietic cell lineages, including blood stem cells, this gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
. other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). ' . • ' 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for. 

■ immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T<ell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- versns-grafr and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelinatiori, systemic 
lupus erythematosis. drug induced hemolytic anemia] rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought.to be useful in 
the expansion of stern cells and committed progenitors of various blood lineages,' and . 
in the differentiation and/or proliferation of various cell types. The tissue distribution 
in brain indicates polynucleotides and polypeptides corresponding to this gene are 
useful for the detection, treatment, and/or prevention of neurodegenerative disease 

■ slates, behavioral disorders, or inflammatory conditions. Representative uses are 
described in the "Regeneration" and "Hyperprdliferative Disorders" sections below, in 
Example 11, 15, and 18, and elsewhere herein. Briefly, the uses include, but are not 
limited to the detection, treatment, and/or prevention of Alzheimer's Disease. 
Parkinson's, Disease, Huntington's Disease; Tourette Syndrome, meningitis, 
encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, 
hemorrhages, schizophrenia, mania, dementia; paranoia, obsessive compulsive 
disorder, depression, panic disorder, learning disabilities. ALS, psychoses, autism, 
and altered behaviors, including disorders in feeding, sleep patterns, balance, and 
perception. In addition, elevated expression of this gene product in regions of the 
brain indicates it plays if role in normal neural function. 
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Potentially, this gene product is involved in synapse formation, 
neurotransmission. learning, cognition, homeostasis, or neuronal differentiation or 

. survivaJ. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, us well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:44 and may have been publicly available prior to conception .of' 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention! To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2558 of SEQ ID NO:44, b is an : 
integer of 15 to 2572, where both a and b correspond to the positions of nucleotide 

1 residues shown in SEQ ID NO:44, and where b is greater than or equal to a + 14. 

FEATURES OF PROTKIN ENCODED BV GENE NO: 35 

This gene is expressed primarily in adult pulmonary tissue, umbilical vein, 
prostate, and fetal tissue (e.g.,heart). " - , , 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases of the pulmonary system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the pulmonary system, 
expression of this gene at significantly higher or lower levels is routinely detected in . 
certain tissues or cell types (e.g., pulmonary, cancerous and wounded tissues) or 
bodily fluids (e.g.; serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
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standard gene expression level, i.e., the expression level in healthy tissue or bodily J " 
fluid from ah individual not haying the disorder. - ' , 

Preferred polypeptides of trie present invention comprise immunogenic . . 
epitopes shown in SEQ ID NO: 1 58 as residues: Arg-45;to Gly-51, Glu-75 to Asn-81. 
Polynucleotides encoding said polypeptides arc also provided. v r \ 

The tissue distribution in pulmonary tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection and treatment of , 
disorders associated with developing lungs, particularly in premature infants where 
the lungs are the last tissues to develop. Additionally, the tissue distribution indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
• diagnosis and intervention of lung tumors, since the gene is involved in the regulation 
of cell division, particularly since it is expressed in fetal tissue. Moreover, the V * 
expression within fetal. tissue and other cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may - - 
showAHilify in the diagnosis, treatment, and/or prevention of developmental diseases 
and disorders, including cancer, and other proliferative conditions. Representative ; . \ 
. uses are described in the "Hyperpi oliferau ve Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell } 
death, as occurs in the development of some cancers, or in failure to control the extent: 
' of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications iu the adult for tissue regeneration and the treatment of cancers. lt may 
also act as a morphogen to control cell and tissue type specification. Therefore, the - 
polynucleotides arid polypeptides of the present invention are useful in treating, 
detecting,\and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, nnd/or preventioriof 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in ; 
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proliferating and cancerous cells and tissues. The protein can also be used. to gain new 
insight into the regulation of cellulargrowth and proliferation. Furthermore, the 
protein may also, be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
. interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such us EST sequences, arc publicly, 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NOi45 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 5 12 of SEQ ID NO:45, b is an 
integer of 1 5 to 526 . where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:45, and where b is greater than or equal to a +14. ^ 

FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

This gene is expressed primarily in adipose tissue: 

Therefore , polynucleotides and polypeptides of the i nvention are useful as 
reagents for differential identification of the tissue(s) or cell type(s)' present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, fat metabolism. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the metabolic system, expression of this gene at 
significantly higher or lower levels is. routinely detected in, certain tissues or cell types 
(e.g. f cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
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the expression level in heaithy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 159 as residues: Pro-96 to Ser-106. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in adipose tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment of obesity and 
other metabolic and endocrine conditions or disorders. Furthermore, the protein : 
product of this gene may show utility in ameliorating conditions which occur 
secondary to aberrant fatty-acid metabolism (e.g. aberrant myelin sheath 
development), either directly or indirectly. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible "through sequence databases" Some of these sequences are 
related to SEQ ID NO:46 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1018 of SEQ ID NO:46, b is an 
integer of 15 to 1032, where both a and b correspond to the positions of nucleotide .■ 
residues shown in SEQ ID NO:46. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

> This gene is expressed primarily in adult brain tissue, testes, placenta, kidney, . 
infant and fetaltissue (e.g., liver, spleen, lung) and to a lesser extent iii immune cells 
(e.g./Trcells and neutrophils) and in cancerous tissues (e.g.,ovarian tumor, Hodgekins 
lymphoma, pancreas. f-ceU). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) orceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to, CNS disorders, disorders of the testicles, cancer, particularly ovarian, 
pancreatic, T-ccIL and Hodgekins lymphoma: Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain,CNS , and testes expression of this * 
gene at significandy higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., neural, urogenital, cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory, conditions. 
Representative uses are described in the "Regeneration" and "Hyperprolifcrative 
Disorders", sections below, in Example 1 1, 15, and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tburette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, * 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder; learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

. Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Moreover, the expression within fetal tissue and other cellular sources 
marked by proliferating cells indicates this protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis, treatment, ancVor prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
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and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation arid/or apoptosis in pattern f 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in, the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of . 
potential roles in proliferation and differentiation^ this gene product may have 
applications in the adult for' tissue regeneration and the treatment of cancers. -It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides^as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used lo gain new 
insight into the regulation of cellular growth and proliferation. The tissue distribution ' ' 
indicates polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis and treatment of a variety of immune system disorders. Representative 
uses are described in the "Immune Activity' 1 and "infectious disease" sections below, - 
inExample 11, 13, 14, 16/18, 19, 20, and 27, and elsewhere herein. Briefly,the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine . 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunologicaJ disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
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such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
• tissues, such as host -versus -graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, iense tissue injury, demyelination, systemic 
lupus erythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem ceils and committed progenitors of various blood lineages, and , 
, in the differentiation and/or proliferation of various cell types. Additionally, the 
tissue distribution in testes indicates that polynucleotides and polypeptides - 
corresponding to this gene are useful for the treatment and diagnosis of conditions 
concerning proper testicular function (e.g. endocrine function, sperm maturation), as 
well as cancer. Therefore, this gene product is useful in the treatment of male 
infertility and/or. impotence. This gene product is also useful in assays designed to 
identify binding agents, as such agents (antagonists) are useful as male contraceptive 
agents. Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that is expressed, particularly at tow levels, in other tissues of the body. Therefore, 
this gene product is expressed in other specific tissues or organs where it may play 
related functional roles in other processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. 
Furthermore, the protein may also be used to determine biological activity, raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as. antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:47 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are'specifically • 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or ' 
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more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 2666 of SEQ ID NO:47, b is an 
integer of 15 to 2680, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:47, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

When tested against fibroblast cell lines, superriatants removed from cells 
containing this gene activated the EGRJ assay. Thus, it is likely that this gene 
activates fibroblast cells through a signal transduction pathway. Early growth 
response I (EGRI) is a promoter associated with certain genes that induces various 
tissues and cell types upon activation, leading the cells to undergo differentiation and 
proliferation. 

This gene is expressed primarily in endometrial stromal cells: endometrial 
tumors, keratinocytes, fetal tissue (e.g. liver, spleen) and to a lesser extent in 
endothelial cells and immune cells (e.g., T-cells). ' : ' 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identi fication of the tissue(s) or. cell rype(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, endometrial carcinoma and immune cells disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the female 
reproductive system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g.; immune, cancerous arid 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial Quid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in the endometrium indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for treating female infertility. The 
protein product is likely involved in preparation of the endometrium of implantation 
and could be administered either topically or orally. Alternatively, this gene could be. 
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transfected in gene-replacement treat merits into the cells of the endometrium and the 
protein products could be produced Similarly, these treatments could be performed 
during artificial insemination for the purpose of increasing the likelyhood of 
implantation and development of a healthy embryo. In both cases this gene or its gene 
product could be administered at later stages of pregnancy to promote heathy 
development of the endometrium. Additionally , polynucleotides and polypeptides .' 
corresponding to this gene are useful for diagnosis' and treatment of endometrial 
carcinoma. The. tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene arc useful for the diagnosis and treatment of cancer and 
other proliferative disorders. Expression within embryonic tissue and other. cellular 
sources marked by proliferating cells indicates that this protein may play a role in the 
regulation of cellular division. Additionally, the expression in hematopoietic cells and 
tissues indicates that this protein may play a role in the proliferation, differentiation, 
and/or survival of hematopoietic cell lineages. In such an event, this gene is useful in 
the treatment of lymphoproliferative disorders, and in the maintenance and .. 
differentiation of various hematopoietic lineages from early hematopoietic stem and 
committed progenitor cells. Similarly, embryonic development also involves 
decisions involving cell differentiation' and/or apoptosis in pattern formation. Thus 
this protein may also be involved in apoptosis of tissue differentiation and'could again 
be useful in cancer therapy. The tissue distribution in immune cells such as helper T- 
cells indicates polynucleotides and polype pt ides corresponding to this gene are useful 
for the diagnosis and treatment of a variety of immune system disorders. 
Representative uses are described in the "Immune Activity" and "infectious disease" 
sections below, in Example 1 1,13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. '. 
Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; arid/or activation of hematopoietic cell 
lineages/including hlood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such. 
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as AIDS, leukemia, rheumatoid* arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-ceil mediated cytotoxicity; immune reactions; to transplanted organs and 
tissues, such as host- vers us -graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, tense tissue injury, dcmyelination. systemic 
lupus erythernatosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease,' and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types.. The tissue distribution 
in keratinocytes indicates polynucleotides and polypeptides corresponding to this 
gene are useful for the treatment, diagnosis, and/or prevention of various skin 
disorders. Representative uses are described in the "Biological Activity", 
"Hyperproliferative Disorders", "infectious disease", and "Regeneration" sections 
below, in Example II, 19, and 20, and elsewhere herein. Briefly, the protein is useful 
in detecting, treating, and/or preventing congenital disorders (i.e. nevi, moles, 
freckles, Mongolian spots, hemangiomas, port-wine syndrome), integumentary 
tumors (i.e. keratoses. Bo wen's Disease, basal cell carcinoma, squamous cell 
carcinoma, malignant melanoma, Paget's Disease, mycosis fungoides, and Kaposi's 
sarcoma), injuries and inflammation of the skin (i.e. wounds, rashes, prickly heat 
disorder, psoriasis, dermatitis), atherosclerosis, ulicaria, eczema, photosensitivity, 
autoimmune disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, 
scleroderma, pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, 
purpura, and xanthelasma. In addition, such disorders may predispose increased 
susceptibility to viral and bacterial infections of the skin (i.e. cold sores, warts, 
chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, erysipelas, 
impetigo, tinea, althlctcs foot, and ringworm). Moreover, the protein product of this 
gene may also be usefuf for the treatment or diagnosis of various connective tissue ' 
disorders (i.e.. arthritis, trauma, tendonitis, chrondomalacia and inflammation, etc.), 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, 
dermatomyositis, etc.). dwarfism, spinal deformation, joint abnormalities, amd 
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. '' "> chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial /. • 

' osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Scfamid). 
10 Furthermore, the protein may also be used to determine biological activity, to raise 

'" ; antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 

5 that modulate their interactions, in addition to its use as a nutritional supplement. 

Protein, as well as, antibodies directed against the protein may show utility as a tumor ' 
marker and/or immunotherapy, targets for the above listed tissues; 1 \' 

Many polynucleotide sequences, such as EST sequences, are publicly • 
available and accessible through sequence databases: Some of these sequences are 
related to SEQ ID NO:48 arid may have been publicly available prior to conception of 
V i the present invention/Preferably, such related rwlynucie^tides are specifically 

excluded from the scope of the present invention, to list every related sequence is 
cumbersome. Accordingly, preferably, excluded from the present invention are one or 
* 25 • ' . • more polynucleotides comprising a nucleotide sequence described by the general 

15 formula of a-b, where a is any integer between 1 to 1716 ofSEQ IDN6:48, b is an : 
integer of 15 to 1730,. where both a arid b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:48, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

20 this gene is expressed primarily in LNCAP cells, (prostate cell line) and retina 

derived N2b5HR cells. '"' • . . V- 

therefore; polynucleoddes and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a . 
. biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, prostate cancer and eye disorders. Similarly, polypeptides and, * 
antibodies directed to. these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of iht above tissues or cells, particularly of the male urogenital and , 
\ reproductive system expression of this gene at significantly higher or lower levels is 

. 30 routinely detected in certain tissues or cell types (e g!, cancerous and wounded 

■tissues), or bodily fluids (e.g;, serum, plasma, urine, rf synovial fluid and spinal fluid) or 
50 1 another tissue or cell sample taken from an individual having such a disorder, relative : 
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to the standard gene expression level, i.e., the expression level in healthy tissue or ' 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 162 as residues: Asn-50 to Ser-57. Polynucleotides ; 
encoding said polypeptides are also provided. 

The expression in prostate may indicate the gene or its products can be used in the 
disorders of the prostate, including inflammatory disorders, such as chronic - 
prostatitis, granulomatous prostatitis and malacopiakia, prostatic hyperplasia and 
prostate neoplastic disorders, including adenocarcinoma, transitional cell carcinomas, 
ductal carcinomas, squamous cell carcinomas, or as hormones or factors with 
systemic or reproductive functions. The tissue distribution in retina indicates that - 
polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment and/or detection of eye disorders including blindness, color blindness, 
impaired vision, short and long sightedness, retinitis pigmentosa, retinitis proliferans, 
and retinoblastoma, retinochoroiditis, retinopathy and retinoschisis. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor markerand/or 
immunotherapy targets for the above listed tissues. - 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these' sequences arc 
related to SEQ ID NO:49 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome; Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 26 1 of SEQ ID NO:49, b is an 
integer of 15 to 1275, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:49, and where b is greater than or equal to a + 14. . 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: RCCCRGCSCRARLCPPARSTAVAPECRGAHPSR 
AMRPGTALQAVLLAVLLVGLRAATGRLLSGQPVGRGGTQRPCYKVIYFHD 
TSRRLNFEEAKEACRRGWRPASQHR VLKMNRN (SEQ ID NO: 3 15)- 
Polynucleotides encoding these polypeptides are also provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MRPGTALQAVLLAVLLVGLRAATGRLLSGQPVCRGG 
TQRPCYKVIYFHDTSRRLNFEEAKEACRRGWRPASQHR VLKMNRN (SEQ ID 
NO: 316). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in smooth muscle and human thyroid and to a 
lesser cxLent in amniotic cells and human endometrial stromal cells- treated with 
progesterone. ' 4 

Therefore, polynucleotides and polypeptides of the in vend on are useful as 
. reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, thyroid disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endocrine system, expression of this gene at * 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 163 as residues: Ser-75 to Leu-8 1. Polynucleotides 
encoding said polypeptides are also provided. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of endocrine 
disorders of the thyroid. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO:50 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 1748 of SEQ ID NO:50 T b is an 
integer of 15 to 1 762, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 50, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene is expressed primarily in human testes tumor and bone marrow. 
- Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type (s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the testicles including but not limited to testicular * 
cancerand immune system disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 

' identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the male reproductive system and immune system 
expression of this gene at significantly higher or lower levels is routinely detected in 

, certain tissues or cell types (e.g., reproductive, immune, cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) of 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. . . . , 
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Preferred polypeptides of the present invention comprise immunogenic. 
■ epitopes shown in SEQ [D NO: 164 as residues: His-3 1 to G|y-41. Polynucleotides 
encoding said polypeptides are also provided. '- ' 

The. tissue distribution in testes, particularly testicular tumors, indicates that 
5 polynuc'eotices and polypeptides corresponding to this gene arc useful for the * * : 
treatment and diagnosis of conditions concerning proper testicular function (e.g. 
, endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This'gcnc product is . ' 
also useful in assays designed to identify binding agents, as such agents (antagonists) 
10 are useful as male contraceptive agents. Similarly, the protein is believed to be useful 
in the treatment and/or diagnosis of testicular cancer. The testes are also a site of > 
active gene expression of transcripts that is expressed, particularly at low levels, in 
other tissues of the body. Therefore, this'gene.product is expressedin other specific 
tissues or organs where it may play related functional roles in other processes, such as 
i 5 hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. The tissue distribution in bone marrow indicates 
polynucleotides and polypeptide! corresponding to this gene arc useful for the 
diagnosis and treatment of a variety of immune system disorders. Representative uses ' 
are described in the "Immune Activity" and "infectious disease' 1 sections below, in 
0 Example I K 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including ; . 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
5 treatment of cancer, (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou s Disease, inflammatory 
) bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-vcrsus-graft and grafi-versus-host diseases, or autoimmunity ' " 
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.disorders, such as autoimmune infertility, lense tissue injury, demyciination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents thai modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a rumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:51 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 2045 of SEQ ID NO:51 , b is an 
integer of 15 to 2059, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:51, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

The translation product of this gene shares sequence homology with 
proiocadherins, which are related to cadh'erin, and possess-cell adhesive ability. 
Cadherins are glycosylated integral membrane proteins that are involved in cell -cell 
adhesion. ' > .. • 

This gene is expressed primarily in brain (infant, adult frontal lobe, manic 
depression tissue) and to a lesser extent in epididymus, healing groin wounds, ovary, 
adipocytes, and fetal tissue (e.g., kidney and retina). 
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. \. • -nKreforevpolyoutleotides and polypeptides of the mvenUon are useful as' ' 

reagents for differential idem.fica.ion of , he tissue(s) or cell type(s) present in a . ' 
10 " bi< "° 8ica ' Sam P' e ^ for *WOsk of diseases and conditions which include, butare 

. not limited to. neurodegenerative disorders, impaired male and female fertility, 

anybodies directed to these polypeptides are useful in providing immunological 
probe, for differential identification of ,„e «:ssue( S ) or cell type( S ). For a number of 
disorders of the above .issues orcells, particularly of the nervous system and 
reproductive system expression of this gene at significantly higher or lower levels is 
routine* de.ec.ed in certain tissues or cell , ypes (e.g.; neural, reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and 
spmal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual riot having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 165 as residues: Val-35 to Lys-41, Ser-68 to G.n-73 ■ 
GIu-88 to Ciu.93. Arg-156 to Gly-163, Ala-199 to Gly-206. Asp-216 to Ser-226 Thr- 
249 to Asn-254. A f ?-339 to Pro-345. Ile-370toGly-379. Pro-429 to Glu-434 Arg- 
46. .oPro-466.Ala-475, 0 Thr^2,P ro .585.oG.>-593.0.u.631 to GIn-639 Pro- ' ' 
674 to Pro-682. G.n-7.5 toGly-720. Ser-736 to Arg-742. Polynucleotides encoding .' 
35 said polypeptides ate also provided. , 1 

BLAST analysis reveals high homology to protocadherin sequences. These - 
sequences are related . ocadherin, and possess cell adhesive ability. Such proteins 
may have regulatory functions in .he cell, as well as the cell-cell adhesive properties 
Armbodies produced against these sequences are useful for modulating the binding 
acuvity of these protocadherins, and can be used therapeutically. The tissue 
distribution in brain indicates polynucleotides and polypeptides corresponding to this ' 
gene are useful for the detection, treatment, and/or prevention of neurodegenerative • 
disease states, behavioral disorders, or inflammatory conditions. Representative uses • , 
are described in the "Regeneration" and "Hypcrproliferauve Disorders" sections *. 
Wow. in&anple 1.1, tS, t»d 18. and elsewhere herein. Briefly, the uses ihciude but 
■54 • « n« Urnted «o «*, detecUon. treatment, wd/or prevention of Aizheirne* Dise«e 
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.* Parkinson's Disease, Huntington's Disease. Tourette Syndrome, meningitis, 
encephalitis, demyeiinating diseases, peripheral neuropathies, neoplasia, trauma, 
10 '■' congenital malformations, spinal cord injuries ischemia and infarction/aneurysms, 

hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 
5 disorder, depression, panic disorder, learning disabilities, ALS, psychoses, autism, 
and altered behaviors, including disorders in feeding, sleep patterns, balance, and '■. 
perception. In addition, elevated expression of this gene product in regions of the 
brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in epididymus indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
25 , maturation), as well as cancer. Therefore, this gene product is useful in the treatment 

15 of male infertility arid/or impotence. This gene product is also useful in assays 
' . > designed to.identify binding agents, as -such agents (antagonists) are useful as male ' 

: contraceptive agents. Similarly, the protein is believed to be useful in the treatment 

30 • , and/or diagnosis of testicular cancer. The testes are also a site of active gene 

' expression of transcripts that is expressed, particularly at low levels, in other. tissues 

20 of the body! Therefore, this gene product is expressed in other specific tissues or 
35 organs where it may play related functional roles in other processes, such as 

hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. Moreover, the expression within fetal tissue (e.g., kidney 
and retina) and other cellular sources marked by proliferating cells indicates this 
protein may play a role in the regulation of cellular division, and may show utility in ' 
the diagnosis, treatment, and/or prevention of developmental diseases and disorders, 
including blindness, cancer, and other proiifcrauvc conations. Representative uses 
are described in the "HyperproJiferatiyc Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues reiy on decisions 
30 involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulatioh of apoptosis can result in inappropriate suppression of cell 
50 death, as occurs in the development of sorne cancers, or in failure to control the extent 



40 25 



45 



WO 00/06698 



114 



PCT/US99/PU0 



of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of ' 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult tor tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control. cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, anuVor preventing said disorders and conditions: in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
\ differentiation and is useful in the detection, treatment, and/or prevention of ' 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the . 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
inarjtei-s, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as KST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:52 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide .sequence described by the general 
formula of a-b. where a is any integer between 1 to 3268 of SEQ ID NO:52,.b is an 
integer of 15 to 3282. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:52, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 
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IRHEQQGEEDDEHARPLAESLLLAi ADLLFCPDFIVQSHRRSTVDS AEDVHSL 
DSCEYIWEAGVGFAHSPQPNY1HDMNRMELLKLIXTCFSEAMYIPPAPESGS ' 
TNPWVQFFCSTENRHALPLFTSLLNTVCAYDPVGYGIPYNHLLFSDYREPLVE 
EAAQVLIVTLDHDSASSASPTVDGTTTGTAMDDADPPGPENLFVNYLSRIHRE 
EDFQFILKGIARLLSNPLLQTYLPNSTKropVPPGAASSliEALRLQQEIPLLRA 
EEQRRPRI IPCPHPLLPQRCPGRS V (SEQ ID NO: 317). Polynucleotides encoding 
such polypeptides are also provided. , 

. This gene is expressed primarily in brain, breast, breast cancer tissue and to a . 
' lesser extent in cpididymus, amniotic cells, and embryo tissue. 

Therefore, polynucleotides and polypeptides of the in vention are useful as 
reagents fordifrerenual identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and condition!; which include, but are 
not limited to, neurodegenerative disorders, impaired CNS -function, male sterility, 
and breast cancer. Similarly; polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell typc(s). For a number of disorders of the above 
tissues or cells, particularly of the nervous and reproductive systems, expression of 
this gene at significantly higher or lower levels is routinely detected in certain tissues 
or cell types (e.g.. neural, male reproductive, cancerous and wounded tissues) or 
bodily fluids (e.g., amniotic, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 166 as residues: Pro-22 to Pro-3 1 , Ser-38 to His-43, 
■ Asp-74 to Leu-79, Asp-1 13 to Glu-121, Leu-157 to Val-166, Ala-189 to Arg-196, 
Gin-206 to Arg-2 1 1 . Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in brain, particulary in the cerebellum, indicates 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders; or inflammatory conditions. Representative uses are described in the 6 
"Regeneration" and "Hyperproliferati ve Disorders" sections below 1 , in Example 1 1 , 
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15. 'and 18, and elsewhere herein. Briefly, (he uses include, but arc not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyeiinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
5 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses; autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a roie in 
10 normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival.. The tissue distribution in epididymus indicates that polynucleotides and 
25 polypeptides corresponding to this gene are useful for the treatment and diagnosis of 

15 conditions concerning proper testicular function (e.g. endocrine function, sperm 

maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
20 and/or diagnosis of testicular cancer. The testes are also a site of active gene 
35 expression of transcripts that is expressed, particularly at low levels, in other tissues 

, of the body. Therefore, this gene product is expressed in other specific tissues or 
organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
25 possible target indications. The expression in the breast tissue may indicate its uses in 
the diagnosis and/or treatment of breast neoplasia and breast cancers, such as 
fibroadenoma, pipillary carcinoma, ductal carcinoma, Puget's Disease, medullary 
carcinoma, mucinous carcinoma, tubular carcinoma, secretory carcinoma and 
apocrine carcinoma, as well as juvenile hypertrophy and gynecomastia, mastitis and 
30 abscess, duct ectasia, fat necrosis and fibrocystic diseases. Moreover, the expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
50 indicates this protein may play a role in the regulation of cellular division, and may * 
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show utility in the diagnosis, treatment, and/or prevention of developmental diseases 
and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "Hyperproiiferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulatiort of apoptosis can result in inappropriate suppression of ceil 
.death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain . 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation arid. differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
; markers; to isolate cognate Ugands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies, directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. . 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ fD NO: 53 and may have been publicly available prior to conception of , 
the present invention. Preferably, such related polynucleotides are' specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from theprcsent invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general . 
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. formula of. a-b, where a is any integer between 1 to 1 846 of SEQ ID NO:53, b is an 
integer of 15 to 1 860, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:53, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO; 44 

Contact of cells will] supernatant expressing the product of this gene increases 
.the permeability of monocytes to calcium.. Thus,* it is likely that the product of this 
gene is involved in a signal transduction pathway that is initiated when the product of 
this gene binds a receptor on the surface of the monocyte cell. Thus, polynucleotides 
.and polypeptides have uses which include, but arc not limited to, activating monocyte 
cells/ ; " V. , ' ' 

This gene is expressed primarily in CD34 positive cells derived from human 
cord blood. ; v '■ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders; immune dysfunction; defects in hematopoietic 
stem and progenitor cells; susceptibility to chemotherapy and irradiation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e;g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or f 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. ' < .' 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 167 as residues: Ala-38 to Leu-59, Ala*3 to Thr-71, 
Lys-82;to Leu-91, Glu-97 to Ser-U)7, Guv-143 to Ala-149. lie- 1-53 to Leu-158, Ser- 
169 to Arg-182. Polynucleotides encoding said polypeptides are also' provided. 
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.♦ , Elevated expression of this gene product in CD34 positive hematopoietic ceils 

indicates that it is expressed by early stem and progenitor cells of the hematopoietic 
; lineages. Therefore; this may represent a soluble factor that is able to control the 
, survival, proliferation, differentiation, or activation of all hematopoietic lineages, 
incmding stem and progenitor ceils. Thus, it could be quite useful,' for example, in ex 
vivo expansion of stem cell numbers for hematopoietic disorders or for cancer 
patients. Alternately, it may represent a factor that influences the hematopoietic 
microenvironment by affecting stromal cells that release other factors required for 
hematopoietic development. Additionally, the tissue distribution in CD34 positive 
cells also indicates polynucleotides and polypeptides corresponding to this gene are . 
useful for the treatment and diagnosis of hematopoietic related disorders such as 
anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells 
are important in the production of cells of hematopoietic lineages. Representative uses 
are described in the "Immune Activity;' and "infectious disease" sections below, in 
Example 1 1, 13, 14, 16. 18, 19, 20, and 27, and elsewhere herein. Briefly, the uses 
include bone marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. 

, The gene product may also be involved in lymphopoiesis; therefore, it can be 
used in irnrnune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:54 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 756 of SEQ ID NO:54, b is an 
integer of 15 to 770, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:54, and where b is greater than or equal to a +44. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

This gene is expressed primarily in breast and U-vJeek old human embryos 
' and to a lesser extent in stomach pancer and liver. ...... 

' -Therefore, ^iynucleotides and polypq^ides of the ihvenaon ae useful' 'as 

. reagents for differential identification of the tissues) or cell rype( S ) p res e n t in a 
biological sample and tor diagnosis of diseases and conditions which include but are 
no, limited to, breast cancer: stomach cancer, embryonic defects; hepatic disorders 
Similarly, po.ypeptides ard antibodies directed to these polypeptides are useful in 
• ProWding iramun °'og^ Probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive and endocrine systems, expression of this gene at significantly higher or ■ 
lower levels is routinely detected in certain' tissues orcell <ypes ( e.g.. cancerous and 
wounded tissue,) or bodily fluids (eg., serum, plasma, urine, synovia. f| uid ^ spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder 
relative to the standard gene expression level, i.e., the expression ievel in heaithy 
tissue or bodily fluid from an individual not having the disorder. ' 

The tissue distribution indicates that the protein products of this gene are 
useful for the diagnosis and/or treatment of a variety of disorders. Elevated expression 
of th,s gene product in stomach cancer indicates it is useful as a marker or therapeutic ' 
targe, for stomach cancer. Alternately, expression in breasftissue is influenced by the . 
presence or absence of breast cancer tissue, and may thus also serve as a diagnostic 
marker for this cancer as.well. Expression in the developing embiyo may correlate • 
with the normal development of human embryhos; and expression in the liver is 
involved in the regulation of normal liver function and/or liver regeneration. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases Some of these sequences are - 
related to SEQ ID NO:55 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically . 
excluded from the scope of the present invention. To Us, every related sequence ,s ' 
cumbersome. Accord.ng.y, preferably excluded from the present invention are one or ' 
more polynucleotides comprising a nucleotide sequence described by the genera. 
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formula of a-b : where a is any integer between 1 to 1079 of SEQ ID NO:55, b is an 
^ integer of 15 to 1093 . where both a and b correspond to the positions of nucleotide * 
residues shown in SEQ '.ID NO: 55, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed primarily in human hypothalamus derived from a 
patient with schizophrenia. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. schizophrenia; neurological disorders; impaired nervous system 
function. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or eel! type(s) . For a number of disorders of the above tissues or cells, 
particularly of the nervous system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., neural, ' • 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having . 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from ari individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 169 as residues: Glu-34 to Trp-39. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in brain, particularly in the hypothalamus, indicates 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, anoVor prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions, Representative, uses are described in the 
"Regeneration" and "Hyperproliferative Disorders" sections below, in. Example 1 1, 
15, and 18, and elsewhere herein. Briefly, the uses included but arc not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyei mating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
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cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception: In addition, elevated 
5 expression of this gene product mregions of the brain indicates it. plays a role in 
normal neural. function. "'. 

Potentially, this gene product is involved in synapse formation, ' , . 
neurotransmission, learning, cognition, homeostasis, or neuronal Differentiation or 
survival. Furthermore, the protein may' also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to .* 
identify agents that modulate their interactipns,!in addition to its use as a nutritional 
supplement. Protein, as well as. antiboiJies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, axe publicly 
available and accessible through sequence databases.' Some of these sequences are 
related to SEQ ID NO:56 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention: To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide. sequence described by the general 
formula of a-b, where a is any integer between 1 to 618 of SEQ ID N 0:56, b is an . 
integer of 1 5 to 632, where both a and b correspond to the positions of nucleotide *■ 
residues shown in SEQ ID NO:56. and where b is greater than or equal to' a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

. - ' The translation product of this gene shares sequence homology with human 

lecithin-cholesterol acyltrafisferase (LCAT), which catalyses the transfer of fatty acid 

from the sn-2 position of lecithin to the free hydroxyl group of cholesterol. Preferred 

polypeptides of the invention comprise the following amino acid sequence: RLVYN 

KTSRATQFPDGVDVRVPGFGKTFSLEFLDPSKSSyGSYFHTMVESLVGWGYT 

RGEDVRGAPYDWRRAPNENGPYFLALREMIEiEMYQLYGGPVVLVAHSIVlGN 
MYTLYFLQRQPQAWKDKYIRAFVSLGA 
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PLKJREQQRSAVSTSWLLPYNYTWSPEKVF^ 

DGWLMRQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFG 
DGDGTVNLKSALQCQAWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRV 
LLGP (SEQ ID NO: 3 1 8). Polynucleotides encoding such polypeptides are also 
provided. 

This gene is expressed primarily in osteoblasts & dendritic cells and to a lesser 
extent in muscle and other hematopoietic cell lineages. . * 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell iype(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders; immune dysfunction; osteoporosis; 
osteopetrosis; muscle degeneration. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the skeletal and immune systems, expression of this 
'gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO; 170 as residues: Cys-65 to Ser-7 1. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution and homology to letimin-cholesterolacyltransierase 
(LCAT) indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and/or treatment of a variety of disorders. For example, 
artheroscelerosis is a pathological condition of mammals characterised by the 
accumulation of cholesterol in the arteries, which leads to heart disease, strokes, heart 
attacks and peripheral vascular disease. The enzyme could be used in a novel method 
of treating atherosclerosis, which involves increasing the level of LCAT activity, 
which then causes a decrease in the accumulation of cholesterol. The method and the 
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products can be used for the prophylaxis and treatment of atherosclerosis, and 
associated heart disease, myocardial infarction, stroke and peripheral vascular disease, 
as we)l as individuals suffering from Fish Eye Syndrome (caused by LCAT 
deficiency) or Classic LCAT Deficiency Syndrome. Alternately, elevated. expression 
of this gene product in osteoblasts and hematopoietic cell lineages indicates that it 
may play additional roles in bone turnover, regulation of immune system function, 
and muscular function. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:57 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded ^ from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2673 of SEQ ID NO:57, b is an 
integer of 15 to 2687, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:57, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

When tested against HELA epithelial cell lines, supcrnatants removed from 
cells containing this gene activated the GAS assay. Thus, it is likely that this gene 
activates epitheliaJ cells through the Jak-STAT signal transduction pathway. The 
gamma activating sequence (GAS) is a promoter element found upstream of many 
genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a 
large, signal transduction pathway involved in the differentiation and proliferation of 
cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of the 
GAS element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. A 

This gene is expressed primarily in aduit brain, infant brain, fibroblasts, ■ ' " . ' 

embryonic and fetal tissue (e.g., spleen. liver), placenta and to a lesser extent in 

■ * > ■" ■ ■ 

endocrine organs, cancerous colon and breast. ' . ' t , 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and tor diagnosis of diseases and conditions which include, but are 
not limited to. dementia, epilepsy, schizophrenia, and developmental abnormalities. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s ). For a number of disorders of the above tissues or cells, particularly of the 
neural system, endocrine system, and during development, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g:, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of ' 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative - 
Disorders" sections below, in Example 11, 15, and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection; treatment, and/or prevention of ' 
Alzheimer's Disease. Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding; sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, teaming, cognition, homeostasis, or neuronal differentiation or 
survival; In addition, the expression of this gene product in synovium (synovial 
sarcoma) would suggest a role in the detection and treatment of disorders and 
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conditions afflicting the skeletal system, in particular osteoporosis, bone cancer, 
connective tissue disorders (e:g. arthritis, trauma, tendonitis, chfondomalacia and 
inflammation). The protein is also useful in the diagnosis or treatment of various 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, and . 
dcrmatomyositis). dwarfism, spinal deformation, joint abnormalities, and 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
osteoarthritis, Atelosteogenesis type IL metaphyseal chondrodysplasia type Schmid, 
etc.). The tissue distribution in endocrine tissues indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of various endocrine disorders and cancers. Representative uses are 
described in the "Biological Activity", "Hyperproliferative Disorders", and "Binding ' 
Activity" sections below, in Example 1 1, 17, 18, 19. 20 and 27, and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of 
Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of the 
pancrease (e.g.- diabetes rnellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper- 
, hypoparathyroidism) , hypothallamus, and testes. Additionally, the expression within 
fetal tissue, cancerous colon and breast, and other cellular sources marked by 
proliferating cells indicates this protein may play a role in the regulation of cellular 
division, and may show utility, in the diagnosis, treatment, and/or prevention of 
developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. , 

; Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent" 
of cell death, as is believed to occur in acquired immunodeficiency and certain - 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product-may have 
applications in the adult for tissue regeneration and the treatment of cancers; It may 
also act as a morphogeri to control cell and tissue type specification. Therefore, the 
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polynucleotides and polypeptides of the present invention are uscful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the. ' 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands of receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 5 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
-available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:58 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general ^ 
formula of a-b, where a is any integer between 1 to 605 of SEQ ID NO: 58, b is an 
integer of 15 to 619, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 5 8, arid where b is greater than or equal to a -i- 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

Preferred polypeptides of the invention comprise the following amino acid 
sequence or a subfragment thereof: MNKEDKVWNDCKGVNKLTNLEEQYIILIFQ 
NGLDPPANMVreSIINEIGIKNNISNFFAKIPFEEANGRLVACTRTYEESIKGSC 
GQKENKIKTVSFESKIQLRSKQEFQFFDEEEETGENHT1FIGPVEKLIVYPPPPA ! 
KGGISVTNEbLHCLNEGEr^NDVlIDFYLKYLVLEKLKKEDADRmiFSSFFYK < 
RLNQRERRNHETTNLSIQQKRHGRVK 

AVVCrYGLEKPKYEPlsTPHYHENAVIQKCSTVEDSaSSSASE^SCSQNSSAK 
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NKGESTSQK.S (SEQ ID NO: 319). Polynucleotides encoding such polypeptides are 
also provided. 

This gene is expressed primarily in fetal tissue, stomach, brain, endometrial 
ceils, and bone and to a lesser extent in prostate, retina, adipocytes, smooth muscle, 
and tumors of the endometrium, ovaries, and parathyroid. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. disorders of the endocrine system, ulcers, stomach cancer, epilepsy, 
schizophrenia, dementia, bone growth, developmental disorders arid resorption. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive system and neural systems expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., neural, " * 
endocrine system, cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual haying such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. ■ 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or iniflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 11, 15, and 18, and elsewhere herein. Briefly, 
the uses include; but are hot limited to the detection, treatment; and/or prevention of 
Alzheimer's Disease. Parkinson's Disease, Huntington's Disease. Toureite Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma; congenital malformations, spinal cord injuries, ischemia and infarction, * 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
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psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 1 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, ur neuronal differentiation or 
survival. Expression of this gene product in stomach tissue indicates involvement in 
digestion, processing, and elimination of food, as well as a potential role for this gene 
as a diagnostic marker or causative agent in the development of stomach cancer, and 
cancer in general. .The expression within embryonic, fetal tissue and other cellular 

, sources marked by proliferating cells indicates this protein may play a role in the 
regulation of cellular division, and may show Utility in the diagnosis, treatment, 
and/or prevention of developmental diseases and disorders, including cancer, and 

* other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 
and/or apoptosis in pattern formation. 

Dysregulaiion of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers; or in failure lo control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 

' neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the - 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative. conditions. Thus this protein may modulate apoptosis or tissue ' 
differentiation and is useful in- the detection/treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues 1 . The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. The tissue distribution 
in parathyroid rumor indicates polynucleotides and polypeptides corresponding to this 
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gene are useful for the detection, treatment, and/or prevention of .various endocrine 
disorders and cancers/ Representative uses are described in the' "Biological Activity", 
"Hyperproliferative Disorders", and "Binding Activity" 'sections below, in Example . 
1 L 17, 18, 19. 20 and 21, and elsewhere herein. Briefly, the protein can be used for . 
the detection, treatment, and/or prevention of Addison's Disease, Cushing's ' 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus). 
adrenal cortex, ovaries, pituitary (e.g.. hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) ; hypothalamus, and 
testes. The tissue distribution in testes indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays , ' 
designed to identify binding agents, as such agents (antagonists) arc useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
expression of transcripts that is expressed, particularly at low levels, in other tissues 
of the body. Furthermore, the protein may also be used to determine biological 
activity, raise anybodies, as tissue markers, to isolate cognate ligands or receptors, to' 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

■ . Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:59 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope.of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 13<54 of SEQ ID NO:59, b is an 
integer of 1 5 to 1 378, where botH a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:59, and where b is greater than or equal to a + 14. 
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The translation product of this gene shares good protein homology with 
. Xenopus NaDC-2 gene and a rabbit renal sodium/dicarboxylate cotransporter. ' The 
translation product of this gene also shares good homology " with a rat placental protein 
which is a sodium-coupled high affmity dicarboxyiate transporter. Therefore, it is 
likely that that the translated product encoded by this gene shares similar biological 
activity. 

This gene is expressed primarily in the placenta and colon adenocarcinoma^ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental abnormalities as well as failure to thrive anomalies. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s).,For a number of disorders of the above tissues or cells, particularly of the 
female reproductive system and colon, expression of this gene at significantly higher 
or lower levels is routinely detected in certain tissues Or cell types (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., amniotic, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from ari individual having such 
a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic * 
epitopes shown in SEQID NO: 173 as residues : Lys- 166 to Giy- 181. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in human placenta and the shared homology of this 
translation product to a rat placental protein indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and/or treatment 
of disorders of the placenta. Specific expression within the placenta indicates that this 
gene product may play a role in the proper establishment and maintenance of 
placental function. Alternately, this gene product is produced by the placenta and then 
transported to the embryo, where it may play a crucial role in the development and/or 
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; survival of the developing embryo or fetus. Expression of this gene' product in a 
vascular-rich tissue such as the placenta also indicates that this gene product is 
produced more generally in endothelial ceils or within the circulation. In such 
instances, it may play more generalized rolesin vascular function^ such as in 
angiogenesis.lt may also be produced in the vasculature and have effects on other 
cells within the circulation, such as hematopoietic cells. It may serve to promote the 
proliferation, survival, activation, and/or differentiation of hematopoietic cells, as well 
as other ceils throughout the body! The tissue distribution in colon tissue indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders involving the colon. Expression of this gene 
product in colon tissue indicates involvement in digestion, processing, and 
elimination of food: as well as a potential role for this gene as a diagnostic.marker or 
causative agent in the development of colon cancer, and cancer in general. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO: 60 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome: Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to.l 1 12 of SEQ ID NO:60,! b is an 
integer of 15 to 1 1 26, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:60, and where b is greater than or equal to a -+■' 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

This gene is expressed primarily in the spinal cord. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) i or ceil type(s) present in a - 
biological sample and .for diagnosis of diseases and conditions which include, but are 
not limited to. paralysis, neurologic disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides arc useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s).' For a number of disorders of 
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the above tissues or cells, particularly of the nervous system, expression of this gerie 
at significantly higher or lower levels is routinely detected in cenain tissues or cell 
types (e.g., neural, cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
. from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual riot 
having the disorder. , 

The tissue distribution in spinal cord indicates poly nucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
: prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
"Hyperproliferative Disorders" sections below, in Example 1 1, 15^ and 18, and 
elsewhere herein. Briefly, the uses include, but are not limited to the detection, 
treatment, and/or prevention of Alzheimer's Disease,*Parkinson's Disease, 
Huntington's Disease. Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities. ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
■■' expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. • 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological' activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional ; 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases.' Some of these sequences are 
related to SEQ ID NO:61 aud may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present Invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
- formula of a-b, where u is any integer between 1 to 2064 of SEQ ID NO:61, b is an 
integer of 1 5 to 2078, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:6l, and where b is greater than or equal to a +' 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

This gene is expressed primarily in keratinocytes, brain, fetal tissues, 
pericardium, stomach, and cancerous tissues (e.g., stomach, adrenals, parathyroid, 
germ cell, colon, breast): ...... 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a ' 
biological sample and for diagnosis of diseases and conditions which include/but are 
not limited to, skin disorders, neurodegenerative and developmental disorders, heart 
disease, and cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cardiovascular and gastrointestinal systems; 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., neural, immune, cancerous and wounded tissues) or 
bodily fluids (e.g;, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 11,15. and 18, and elsewhere herein. Briefly, 
the uses include, but arc not limited to the detection, treatment, and/or prevention of 
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Alzheimer's Disease. Parkinson's Disease, Huntington's Disease, Tourettc Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, •'. 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction. ; 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep • 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission , learning , cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in keratinocytes indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment, diagnosis, and/or 
prevention of various skin disorders: Representative uses are described in the 
''Biological Activity", "Hyperproliferative Disorders", "infectious disease*, and 
"Regeneration" sections below, in Example 11. 19, and 20, and elsewhere herein. 
Briefly, the protein is useful in detecting, treating, and/or preventing congenital " '" 
disorders (i.e. ncvi, moles; freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (tie. keratoses, Bowen s Disease, basal cell V 
carcinoma, squamous cell carcinoma; malignant melanoma, Paget's Disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflarnmation of the skin (i.e. wounds, - 
rashes, prickly heat disorder" psoriasis, dermatitis),' atherosclerosis, uticaria, eczema, 
" photosensitivity, autoimmune disorders (i.e. lupus erythematosus; vitiligo, 
dennatomyositis, morphea, scleroderma, pemphigoid, arid pemphigus), keloids, striae, 
erythema, petechiae, purpura, and xanthelasma. In addition, such disorders may 
predispose increased susceptibility to viral arid bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molJuscum contagiosum, herpes zoster, boils, cellulitis, 
erysipelas, impetigo, tinea, althletes foot, and ringworm); Moreover, the protein '■ 
product of this gene may also' be useful for the treatment or diagnosis of various 
connective tissue ' di sorders (i.e. , arthritis;, trauma, tendonitis, chf ondomalacia and 
inflammation, etc.), autoimmune disorders (i.e.. rheumatoid arthritis, lupus, 
scleroderma, derniatpmyositis, etc.), dwarfism, spinal deformation joint 
abnormalities, amd chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, 



WO 00/06698 



PCT/US99/17130 



< familial osteoarthritis, Atelosteogenesis type ll y metaphyseal chondrodysplasia type 
Schmid). The expression within fetal tissue (e.g.,' spleen and liver) and other cellular 
sources marked by proliferating cells indicates this protein may play a role in the 
regulation of cellular division, and may show utility in the diagnosis, treatment, 
and/or prevention of developmental diseases and disorders, including cancer, and 
other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving ceil differentiation 
and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
. of ceil death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SM A). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult tor tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the ' 
polynucleotides and polypeptides of the present invention are -useful in treating, 
detecting;, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and.diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous ceils and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Additionally, the 
tissue distribution in the pericardium of the heart indicates that the protein is useful in 
the detection, treatment, and/or prevention of a variety of vascular disorders and 
conditions, which include, but are not limited to miscrovascular disease, vascular leak "' 
syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery' disease; 
arteriosclerosis, and/or atherosclerosis: Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isoiatecognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
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protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 62 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome/Accordingly, preferably excluded from the present invention are one or 
. more polynucleotides comprising a nucleotide sequence described by the general ■ 
formula of a-b; where a is any integer between 1 to 748 of SEQ ID NO:62, b is an 
integer of 15 to 762, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:62, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

This gene is expressed primarily in the brain and in cartilage and to a lesser 
extent in the retina, activated T-cells, pineal gland, the lungs, and in synovial 
sarcoma. , , 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 1 
not limited to, neurological diseases, such as epilepsy and dementia, osteoarthritis, 
retinopathies, hematopoietic diseases, emphysema, and lurig cancer . Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the neurologic 
system, cartilage and musculature, vision, the hematopoietic system, and the 
pulmonary system expression of this gene at significantly higher orjower levels is 
routinely detected in certain tissues or cell types (e.g., neural, immune, cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. ' ' ' • 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 176 as residues: Arg-34 to Cys-44. Polynucleotides ' 
encoding said polypeptides are also provided. 

1 he tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory condidons. , 
Representative uses are described in the "Regeneration" and "Hyperproliferatiye 
, Disorders" sections below, in Example 1 1 , 15, and 18, and elsewhere herein. Briefly, 
' the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome/ 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, AJLS. 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function: r 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in T-cells indicates poly nucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatmeni of a variety of 
immune system disorders. Representative uses are described in the "Immune 
Activity'* and "infectious disease" sections below, in Example 1 1 , 13, 14, 16, 18, 19, 
20, aiid 27, and elsewhere herein. Briefly, the expression of this gene product 
indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production^ antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). . ... . 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
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.as AIDS, leukemia, rheumatoid-arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-celJ mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demy eiinat ion, systemic * 
lupus crythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma.' Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells. and committed. progenitors of various blood lineages, and 

, in the differentiation and/or proliferation of various cell types. The expression of this 
gene product in synovium would suggest a role in the detection and treatment of 

* disorders and conditions afflicting the skeletal system, in particular osteoporosis, bone 
cancer, connective tissue disorders (e.g. arthritis., trauma, tendonitis, chrondomalacia 
and inflammation). The protein is also useful in the diagnosis or treatment of various 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis), dwarfism, spinal deformation, joint abnormalities, and 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
osteoarthritis, Atelosteogeriesis type II , metaphyseal chondrodysplasia type Schmid, 
etc.). Additionally, the expression within fetal tissue and other cellular sources 
marked by proliferating cells indicates this protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis, treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. 

' Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the.extent 
of ceil death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 



WO 00/066*8 PCT/US99/17130 

■ ' 140 ' ' . •• 

applications in the adult for tissue regeneration and the treatment of cancers. It may 
- also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue . . ' • . 
differentiation and is useful in the detection, treatment, and/or prevention of 
■ degenerative or proliferative conditions arid diseases. The protein is useful in * " 

modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous ceils and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity; to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. ; 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:63 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-h, where a is any integer between I to 1080 of SEQ ID NO:63, b is an 
integer of 15 to 1094, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:63, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

This gene is expressed primarily in umbilical vein endothelial cells induced by 

il-4. ' . • ' ' ;.. 

Therefore, polynucleotides and polypeptides of the invention axe useful as 
reagents for differential identification of the tissue(s) or'ceil type(s) present in a ' * 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to, angiogenesis. inflammatory disorders, hematopoietic disease. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
angiogenic and hematopoietic systems, expression^ of this gene at significantly higher 
or lower levels is routinely detected in certain tissuesor cell types (e.g., cancerous and 
, wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in endothelial cells indicates polynucleotides and L . 
polypeptides corresponding to this gene are useful in the detection, treatment, and/or 
prevention of vascular conditions, which include, but are not limited to, microvascular, 
disease, vascular leak syndrome, aneurysm, stroke, atherosclerosis; arteriosclerosis, or 
embolism. For example, this gene product may represent a soluble factor produced by 
smooth muscle that regulates the innervation of organs or regulates the survival of 
neighboring neurons. Likewise, it is involved in controlling the digestive process, and 
such actions as peristalsis. Similarly, iris involved in controlling the vasculature in 
areas where smooth muscle surrounds the endothelium of blood vessels. Furthermore, 
the protein may also be used to determine biological activity, to raise antibodies,' as 
tissue markers, to isolate cognate HgancLs or receptors, to identify agents that modulate 
their interactions, in addition to its use as a nutritional supplement. Protein, as well as,, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. The secreted protein can also be 
used lo determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate iigands or receptors, to identify agents that modulate their interactions, and as 
nutritional supplements. It may also have a very wide range of biological activities. 
Representative uses arc described in the "Chemotaxis" and "Binding' Activity" 
sections below, in Examples 11, 12, 13,- 14, 15, 16, 18, 19, and20. and elsewhere 
herein. Briefly, the protein may possess the following activities: cytokine, cell 
proliferation/differentiation modulating activity or induction of other cytokines; 
irnrnunostimuiating/immunosuppressaril activities (e.g. for treating human 
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immunodeficiency virus infection /cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 
hemostatic or thrombolytic activity (e.g. fur treating hemophilia, cardiac infarction 
etc.): anti-inflammatory activity (e.g. for treating septic shock, Crohn's Disease); as 
antimicrobials: for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. 

Many polynucleotide sequences, such as EST sequences, are publicly • 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 64 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequencers 
cumbersome, Accordingly, preferably excluded from the present invention arc one or . 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1347 of SEQ ID NO:64, b is an 
integer of 15 to 136 1 . where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:64. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 55 

This gene is expressed primarily in both normal and cancerous pancreas.. 

Therefore .polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 1 
not limited to, diabetes, gastrointestinal disorders, and pancreatic cancer. Similarly', 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the digestive and 
blood systems, expression of this gene at significantly higher or lower levels is 
routinely detected iri certain tissues or cell types (e.g., cancerous. and wounded 
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tissues) or bodily fluids (e.g.. scrum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. V 

,, . The tissue ^distribution in pancreas indicates that the protein" products of this ■ 
gene are useful as a therapeutic and/or diagnostic agent for pancreatic disorders and 
disorders of the endocrine and exocrine system, including but not limited to diabetes, 
blood disorders, pancreatic cancer, gastrointestinal diseases, hormomal imbalance, 
autoimmune disorders, cystic fibrosis, pancreatitis, and gallstones. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. • 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:65 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 1 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 933 of SEQ ID NO:65, b is an 
integer of 15 to 947, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:65, and where b is greater than or equal to a + 14. 

FEATURES GF PROTEIN ENCODED BY GENE NO: 56 

The translation product of this gene shares sequence homology with 
oxidoreductasc. Preferred polypeptides of the invention comprise the following amino 
acid sequence: MSPLSAARAALRVYAVGAAVILAOLLRRCRGGFLEPVXPPRP 
DRVAIVTGGTDGIGYSTANI^DIGMHVnAGNNDSKAKQVVSFaKEETXNP 
KVEliYCDLASMTSIRQFVQKJ^^ 

HFGLN YLG1 IFLLTNLLLDTLKESGSPGHS ARV VTVSSATHYVAELNMDDLQS 
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SACYSPHAAYAQSKLALVLFTYHLORLLAAEGSHVTANVVDPGVyNTO 
* HVFWATRLAKKLLGWLXFKTPDEGA^TSI YAA VTPELEGVGGRYLYNEKET 
KSLHVTYNQKLQQQLWSKSCEMTGVLDVTL (SEQ ID NO: 320). The mature 
form of this protein begins at residue 32. Thus, polypeptides comprising residues 2- 
330 and 32-330 of the sequence shown above are also provided. Polynucleotides 
encoding such polypeptides are also provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MSPLSAARAALRVYAVGAAVILAQLLRRCRGGFLEP 
VXPPRPDRVAIVTGGTDGIG YSTANIWRDLACMLS (SEQ ID NO: 321). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in breast cancer cells, osteoclastoma, wilm's 
tumor, thymus stromal cells, and T cell helper I. 

Therefore; polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer, e.g., breast cancer, osteoclastoma, and wilm's tumor. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing * 
immunological probes for differential identification of the tissue(s) or ceil rypc(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, kidney, immune, hematopoietic, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, breast milk, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to (he standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. ... 

The tissue distribution in breast cancer tissue, combined with the homology to 
oxidoreductase indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for diagnosis and treatment of cancer, particularly, breast cancer, 
osteoclastoma, and wilm's tumor. This protein may play a role in the regulation of ■ 
cellular division, and may show utility in the diagnosis I treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
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conditions. Representative iises are described in the "Hyperproliferative Disorders" 
and "Regeneration sections below and elsewhere herein. Briefly', developmental , 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern, 
formation. , / 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or, in failure to contrul the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore,. the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
different! aiion and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may, exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new*' 
insight into the regulation of cellular growth and proliferation; Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement, protein, as well as, . t 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. ' . 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:66 and may have been publicly available prior to conception of 
the present invention. Preferably; such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded'from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 362 of SEQ ID NO: 66,- b is an 
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integer of 15 to 1376. where both a and b correspond lo the positions of nucleotide 
residues shown in SEQ ID NO: 66, and where b is greater than or equal to a + 14. .> 

FEATURES OF PRO I KIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in monocytes, T cell helper II and B ceil 
lymphoma. . •• ' 

Therefore, polynucleotides and 'polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample andrfor diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly B- 
cell lymphoma. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue (s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g.. immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 1 80 as residues: Asp-30 to Val-40. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in monocytes, T cell helper, and B cell lymphoma cells 
indicates that polynucleotides and polypeptides conesponding to this gene are useful 
for diagnosis and treatment of B cell lymphoma. Representative uses are described in 
the "Immune Activity" and "infectious disease" sections below, in Example 1 1, 13, 
14, 16, 18, 19, 20, and 27; and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
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other processes suggesting a usefulness in the treatment of cancer (e.g by boosting " 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunoiogicai disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthiiu^,. granulomatous Disease, inflammatory * 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted. organs arid ■ 
tissues, such as host- versus- graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, tense tissue injury, demyeli nation, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Diseasc'and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, arid 
in the differentiation and/or proliferation of various ccll types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence, databases. Some of these sequences are* 
related to SEQ ID NO: 67 and may have been publicly available prior to conception of ' 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present; invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are onepr 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I. to 2420 of SEQ ID NO:67, b is an 
integer of 15 to 2434, where both a and b correspond to the positions, of nucleotide 
residues shown in SEQ ID NO: 67, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 58 

This gene is expressed primarily in human lung cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
bio^ogicai sample and for diagnosis of diseases and conditions which include, but are 
not limited to, pulmonary diseases and/or disorders, particularly cancers of the lung. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in. 
providing immunological probes for differential identification of the tissuc(s) or cell 
type(s). For a number of disorders of the above tissues, or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely detected, in certain tissues or cell types (e.g!, pulmonary, and cancerous and ; 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, pulmonary lavage, 
pulmonary surfactant, synovial fluid and spinal fluid) or another tissue 6r cell sample 
taken from an individual having such a disorder, relative to the standard gene *. 
expression level, i.e„ the expression level in healthy tissue or bodily fluid from an 1 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 18 1 as residues: Phe-39 to Asp-45. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in lung cancer tissue indicates that polynucleotides and 
polypeptides corresponding to this gene arc useful for diagnosis and treatment of . 
immune system disorders such as ARDS, cystic fibrosis, and cancer, particularly lung 
cancer. This protein, may play a role in the regulation of cellular division, and may 
show utility in the diagnosis, treatment, and/or prevention of developmentaldiseases 
and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "HyperproUferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptbsis in pattern formation. 

v Dysregulation of apoptosis can result in inappropriate suppression of cell • 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
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potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention arc useful in treating, 
detecting, ant/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:68 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every' related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 072 of SEQ ID NO:68, b is an 
integer of 15 to 1086, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:68, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

This gene is expressed primarily in larynx carcinoma and early stage human 

lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell typc(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are * 
not limited to, developmental, gastrointestinal, and pulmonary diseases and/or 
disorders, particularly larynx carcinoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s)or ceil type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., developmental, gastrointestinal, pulmonary, and cancerous and wounded 
tissues) or bodily fluids (e.g.. serum, plasma, amniotic fluid, pulmonary lavage, 
sputum, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken' 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not * 
having the disorder: 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 182 as residues: His-42 to Lys-49. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in larynx carcinoma and early stage human lung 
indicates that polynucleotides and polypeptides corresponding to this gene are useful 
for tread ng immune system disorders such as cancer, particularly larynx carcinoma. 
This protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 
described in the "Hyperproliferative Disorders" and "Regeneration" sections below 
and elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregiilation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of eel i death, as is believed to occur in acquired immunodeficiency and certain 
■ neurodegenerative disorders, such as spinal muscular atrophy (SM A). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
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poiynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerati ve conditions: Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of ' 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the ~ 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their : 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, " " 
antibodies directed against the protein may show utility as a tumor marker and/or f ; 
immunotherapy targets for the above listed tissues; * 

Many polynucleotide sequences, such as EST sequences, are publicly . 
available and accessible through sequence databases. Some of these sequences are ' 
related to SEQ ID NO:69 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically < ' " 
excluded from the scope of the present invention. To list every related, sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between i to 1248 of SEQ ID NO:69, b is an 
integer of 15 to 1262, where both a and b correspond to the positions of nucleotide , 
residues shown in SEQ ID NO:69, and where h is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: ^VTTEDTSRTDVSEPATSGGAAIXJVTSIAPTAVASSTTAASrTTA 
ASSMTVASSAPTTAASSTTyASIAITT^ 

TPSTTATGHPSLSTALAQVPKSSALPRTATLATLA1RAQTVATTANTSSPMST 
RPSPSIOiMPSDTAASPVPPMXPQAQGPISQVSVDQPVVNTTXKSTXMPSNTr 
XEPLTQAWDKTLLLVVLLLGVTIJTr^ 
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NGMYADSEM {SEQ ID NO: 322). Polynucleotides encoding these polypeptides arc 
also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: ARCPELPGLRCRPRPRAGPQAPSYCPRATRPPG 
ACCARMRLLLEWRVYLRLTC^ 

PTALQGGRWSRLGASP (SEQ ID NO: 323). Polynucleotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in adipocytes, osteoblasts, cerebellum, 
hypothalamus and Hodgkin's lymphoma. ' 

Therefore, polynucleotides and polypeptides of the invention arc useful as , 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic, skeletal, neural; and immune diseases and/or disorders, 
particularly Hodgkin's lymphoma. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue (s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely delected in certain tissues or cell types 
(e.g., metabolic, skeletal, neural, immune, and cancerous and wounded tissues) or * ■ 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder; relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 183 as residues: Pro-33;to Gln-40, Gly-51 to Arg-56. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in Hodgkin's lymphoma cells indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of immune system disorders such as cancer, particularly Hodgkin's 
lymphoma. The secreted protein can also be used to determine biological activity, to ' ' 
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raise antibodies, as tissue markers, to isolate cognate ligands or receptors, , to identify 
agents that modulate their interactions, and as nutritional supplements. It may also 
have a very wide range of biological activities. Representative uses are described in 
the "Chemotaxis" and "Binding Activity" sections below, in Examples. 1 1 , 12, 13, 14, 
15, 16,. -18, 19, and 20, and elsewhere herein. Briefly, the protein may possess the 
following activities: cytokine, cell proliferation/differentiation modulating activity or 
induction of other cytokines; immunostimulating/immuriosuppressant activities (e.g. 
for treating human immunodeficiency virus infection, cancer, autoimmune diseases 
and allergy); regulation of hematopoiesis (e.g. for treating anemia or as adjunct to 
chemotherapy); stimulation or growth of bone, cartilage, tendons, ligaments and/or 
nerves (e,g. for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g. for treating 
infections, rumors); hemostatic or thrombolytic activity (e.g. for treating hemophilia, 
cardiac infarction etc.); anti-inflammatory activity (e.g. for treating septic shock, 
Grohn's Disease); as antimicrobials; for treating psoriasis of other hyperprdliferative 
diseases; for regulation of metabolism, and behavior. Also contemplated is the use of 
the corresponding nucleic acid in gene therapy procedures. Protein, as well as, 4 
antibodies directed against the protein may show utility as a tumor marker and/of 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences,;such as, EST sequences, are publicly ' 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:70 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising* a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1628 of SEQ ID NO:70, b is an 
integer of 15 to 1642, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 70, and where b is greater than or equal to a + 14. 



WO 00/06698 PCT/USW/17130 

'l54 ' - 

■ ' ' ' * 

FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

The translation product of this gene shares sequence homology with 
polypeptide in the cy statin family. Cystatin polypeptides are cysteine protease 
s inhibitors^ For an analysis of the composition of several members.of the cystatin 
family.see Gene (1987) 61(3):329-338, incorporated herein by reference. The cystatin ' 
activity of poly peptides ejicoded by this gene is measured by several assays known in 
the an including assays described in coowned, copending US Patent Application.;. 
Serial No. 08/744, 1 38, incorporated herein by reference. Preferred polypeptides of the 
invention comprise the following amino acid sequence: LPATVEFAVHTFNQQSKD 
YYAYRLGHILNSWKEQVESKTXTSMEIXLGRTRCGKFEDDroNCHFQESTEL 
NNTFTCFFTISTRPWMTQFSLLNKTC (SEQ ID NO: 324). Fragments of such ' " 
polypeptides having cystatin activity (cysteine protease inhibitory activity are 
particularly preferred). Polynucleotides encoding such polypeptides are also provided. 

., In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: LLWARGLGRAKSAVPTVST MLGLPWKGGLS 
WALLLLLLGSQILLIYAWHFHEQRrX^Er 
KDYYAYRLGHILNSWKEQVESk^ 

STELNNTFTCFFTISTRPWMTQFSLLNK TCLEGFH (SEQ ID NO: 325). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in testes and epididiymus. For a review of a 
cystatin showing testes- specific expression see Mol. Endocrinol. (1992 Oct.) 
6( 10): 1653- 1664, incorporated herein by reference. 

Therefore, polynucleotides and polypeptides of the invention are useful as, 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, They should therefore serve a protective function to regulate die 
activities of such endogenous proteinases, which otherwise may cause uncontrolled 
proteolysis and tissue damage. Cysteine proteinase activity can normally not be 
measured in body fluids, but can been detected extracellulariy in conditions like 
endotoxin-induced sepsis, metastasizing cancer, and at local inflammatory processes 
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in rheumatoid arthritis , purulent bronchiectasis and.periodomitis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune^ 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, testicular* and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, seminal fluid, synovial fluid and . 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 184 as residues; Phe-31 to Asp-38, Asn-59 to Tyr-65, 
Ser-76 to GIu-82, Thr-96 to Cys-108, Gin- 11 1 to Asn-1 18. Polynucleotides encoding 
said polypeptides are also provided. 

The tissue distribution in testes and epididiymus. combined with the 
homology to cystatihs indicates that polynucleotides and polypeptides corresponding 
to this gene are useful for the treatment and diagnosis of conditions concerning proper 
testicular function (e.g. endocrine function, sperm maturation), as well as cancer. 
Therefore, this gene product is useful in the treatment of male infertility and/or 
impotence. This gene product is also useful in assays designed to identify binding 
agents, as such agents (antagonists) are useful as male contraceptive agents! Similarly, 
the protein is believed to be useful in the treatment and/or diagnosis of testicular 
cancer. The testes are also a site of active gene expression of, transcripts that is 
expressed, particularly at low levels, in other tissues of the body. Therefore, this gene 
product is expressed in other specific tissues or organs where it may play related 
functional roles in other processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. 
Representative uses are described in the "Hyperproliferative Disorders" and 
"Regeneration" sections below and elsewhere herein. Briefly, developmental tissues 
rely oh decisions involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of ccjL 
death, as occurs in the development of some cancers, or in failure to control the extent 



WO 00/06698 PCT/US99/17130 

; ' . ' 156 

of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SM A). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification . Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions,, in addition to other types , 
, of degenerative conditions. Thus this protein may modulate npoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Cysteine proteinase inhibitors of the cystatin superfamily^are ubiquitous in the 
body and are generally tight- binding inhibitors of papain-like cysteine proteinases, 
such as cathepsins B. H, L, S, and K. They should therefore serve a protective 
function to regulate the activities of such endogenous proteinases,- which otherwise 
may cause uncontrolled proteolysis and tissue damage. Cysteine proteinase activity 
can normally not be measured in body fluids, but can been detected extracellularly in 
conditions like endotoxin-induced sepsis, metastasizing cancer, and at local ( 
inflammatory processes in rheumatoid arthritis, purulent bronchiectasis and 
periodontitis, which indicates that a tight cystatin regulation is a necessity in the 
normal state. A deficiency state in which the levels of the intracellular cystatin, 
cystatin B, are lowered due to mutations has recently been shown to segregate with a 
forrn of progressive myoclonus epilepsy, which points to additional specialized 
functions of cystatins. Moreover, results showing that chicken cystatin inhibits polio ' 
virus replication, human cystatin C inhibits corona- and herpes simplex virus 
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replication, and human cy statin A inhibits rhabdovirus-induced apoptosis in ceil 

cultures indicates that cystatins play additional roles in the human defense system. 

The cystatins constitute a superfamily of evolutionarily related proteins, all composed 

of at least one 100- 1 20 residue domain with conserved sequence motifs. 

The previously well characterized single-domain human members of this 

superfamily could be grouped in two protein families. The Family, 1 members, 

cystatins (or siefms) A and B, contain approximately 100 amino acid residues lack 

disulfide bridges, arid arc not synthesized as preproieins with signal peptides. The 

Family 2, cystatins (cystatins C, D, S, SN, and SA) are secreted proteins of approx. 

120 amino acid residues (Mr 13,000-14,000) and have two characteristic intrachain ' ' 

disulfide bonds. Recently, we identified an additional human cystatin superfamily 

member by EST 1 sequencing in epithelial cell derived cDNA libraries which we 

named cystatin E. The same cystatin was independendy discovered by differential 

display experiments as a mRNA species down-regulated in breast tumor tissue, but 

present in the surrounding epithelium and reported under the name cystatin M . 

Cystatin E/M is an atypical, secreted low-Mr cystatin in that it is a glycoprotein and 

just shows 30-35% sequence identity in alignments with the human Family 2 

cystatins, which shows that additional cystatin families are yet to be identified. The ' 

cystatin E/M gene has been localized to chromosome 2 , whereas all human Family 2 

cystatin genes are clustered on the short arm of chromosome 20, which further 

stresses that cystatin E/M is just distantly related to the other secreted human low-Mr 

cystatins. It is believed therefore; that polypeptides encoded by this gene are useful in 

diagnosing and treating disease consistent with the aforementioned conditions in 

which cystatins are implicated. ' 

Many polynucleotide sequences, such as EST sequences, arc publicly 

available and accessible through sequence databases. Some of these sequences are • 

related to SEQ ID NO:7I and may have been publicly available prior ^conception of 

the present invention. Preferably, such related polynucleotides are specifically * 

excluded from the scope of the present invention. To list every related sequence is 

cumbersome: Accordingly, preferably excluded from the present invention arc one or 

more polynucleotides comprising a nucleotide sequence described by the general 

formula of a-b, where a is any integer between 1 to 907 of SEQ ID NO:7 1, b is an I, 
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residues shown in SEQ ID NO: 7 1, 'and where b is greater ( than orequal to a+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

The translation;product of this gchc shares sequence homology with 
Neutrophil Gelatinase-Associated Lipocalin which is thought to be important in 
immune regulation (See Genbank and Geneseq Accession Nos. embfCAA58127.1, ' 
and US5627034. respectively ; all references and information available through these 
accessions are hereby incorporated herein by reference ; for example, Biochem. r 
Biophys. Res. Commun. 202 (3), 1468-1475 (1994), i and FEBS Lett. 314 (3), 386.388 
(1992)). • ' * . ( ' ' 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide, are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: LEQKLELHRGGGRSRTSGSPGLQEFGTREERGE 
GEQRTGREFSGNGGRAVEAARMRU.CGL\VXWLSLLKVLQAQTPTPLPLPP 
PMQSFQGNQFQGEWFv-LGLAGNSFRPEHRALLNAFTATFELSDDGRFEVWN 
AMTRGQHCDTWSYVLIPAAQPGQFrydH > 
. GUSDDNIVFPDVTGXAI.OL XSLPWVAAPA (SEQ ID NO: 326). Polynucleotides 
encoding these polypeptides arc also provided. * 

This gene is expressed primarily in epididiymus and osteoclastoma. " ■ . 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive and skeletal diseases and/or disorders, particularly cancers 
such as osteoclastoma testicular cancer. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful iri providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly, of the immune system, expression of this gene 
at significantly higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., reproductive, testicular, skeletal, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, seminal fluid, urine, synovial fluid and spinal fluid) 
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or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e.; the expression level in healthy* , 
- tissue or bodily fluid from an individual not having the disorder. , ^ • 
Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: , 1 85 as residues: Met-82 to Thr-90. Polynucleotides 
encoding said polypeptides are also 1 provided. 

1 The tissue distribution in epididiymus and homology to neutrophil gelatinase- 

associated lipocalin indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for diagnosis and treatment of skin diseases and immune system 
disorders such as cancer, particularly osteoclastoma. The secreted protein can also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, and as 
i nutritional supplements. It may also have a very wide range of biological activities. 
Representative uses are described in the "Chemotaxis" and "Binding Activity" 
sections below, in Examples 11. 12, 13. 14, 15, 16; 18, 19, and 20, and elsewhere 
herein; Briefly, the protein may possess the following activities: cytokine, cell * 
prolifcration/difTerentiation modulating activity or induction of other cytokines; 
immunpstuTiulaung/immunosuppressant activities {e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
■ stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating Hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); . 
hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's Disease); as 
antimicrobials; for treating psoriasis or other hypefjproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as. antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues.'' 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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. related to SEQ ID NO: 72 and may have been publicly available prior to conception of . 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 892 of SEQ ID NO:72, b is an 
integer of 15 to 906, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO;72, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

The translation product of this gene was shown to have homology to colipase 
which plays an essential role in the intestinal fat digestion by anchoring lipase on 
lipid/water interfaces in the presence of bile salts (See Genbank Accession No. 
gb(AAA035 1 3:1; all references and information available through this accession are 
hereby incorporated by reference herein). 

This gene is expressed primarily in epididiymus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive diseases and/or disorders, particularly eptdidiyrhus -related 
diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., reproductive, 
metabolic, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
seminal fluids bile, chyme, urine, synovial fluid and spinal fluid) or another tissue or. 
cell sample taken from an individual having such a disorder; relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. * t 

; Preferred polypeptides of the present invention comprise immunogenic, 
epitopes shown in SEQ ID NO: 186 as residues: IIe-40 to Gys-49, Arg-52 to Cys-57, 
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Ser-94 io Trp-99, Gly-105 to Gly-1 1 1. Polynucleotides encoding said polypeptides 
are also provided. 

The tissue distribution in epididiymus indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immune system diseases and disorders of the epididiymus. Polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
' contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
expression of transcripts that is expressed, particularly at low levels, in other tissues 
of the body. Therefore, this gene product is expressed in other specific tissues or 
organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammarion, bone formation, and kidney function, to name a few 
possible target indications, furthermore, the protein may aJsb be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/of immunotherapy targets tor the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences arc 
related toSEQ ID NU:73 and may have been publicly available prior, to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is> 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 666 of SEQ ID NO:73, b is an 
integer of 15 to 680, where both a and b correspond io the positions of nucleotide 
residues shown in SEQ ID NO:73; and where b is greater than or equal, to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: MCVCERKRGREKEGGVTPTMTSNFPFCTLILGI 
AQAQACPGCPGDWPGLGSGVGEGLHHIRfCRTPIPCSPPAPAAACLGSGH 
ARLPCyLRLVVTVP^NL^SPFRI^ALHC^ 

LAACLLTFQKTPLELPMAWHLLVATPCYQMLDNLPLPSAAAN WC (SEQ1D 
NO: 327). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in melanocytes and placenta and to a lesser, 
extent in bone marrow and many cells of the immune system, including B-cells, 
dendritic ceils, arid T-cells. • • 1 

Therefore, polynucleotides and .polypeptides of the' invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, skin cancer and disorders of the reproductive and immune systems. 
Similarly , polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissuc(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of, the 
reproductive and immune systems, expression of this gene at significantly higher or 
lower levels is routinely detected'in certain tissues and cell types (e.g., reproductive 
tissue, hematopoietic tissue, melanocytes and ceils and tissue of the immune system, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, amniotic 
fluid, urine, synovial fluid and spinal lluid) or another tissue or cell sample taken 
t from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. - , • . - ' ■ 

The tissue distribution in melanocytes indicates that polynucleotides and 
polypeptides corresponding to this gene arc useful for the diagnosis and treatment of 
disorders affecting the skin, the reproductive system, and the immune system, 
particularly cancers. Representative uses are described in the "Biological Activity", 
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"Hypcrproliferative Disorders .."infectious disease^ and "Regeneration" sections , 
below, in Example 11, 19, and 20. and elsewhere herein. Briefly, the protein is useful 
10 m detecting, treating, and/or preventing congenital disorders (i!e. nevi; moles, 

freckles, Mongolian spots, hemangiomas: portrwine syndrome ). integumentary 
. : 5 tumors (i.e. keratoses. B owen's Disease, basal cell carcinoma, squamous cell 

carcinoma, malignant melanoma, Paget's Disease,. mycosis fungoides, and Kaposi's 
sarcoma), injuries and inflammation of the skin (i.e. wounds, rashes, prickly heat 
disorder, psoriasis, dermatitis}* atherosclerosis, uticaria, eczenuu photosensitivity, 
autoimmune disorders (i.e. lupiis erythematosus, viuligo,fdermatornyositis, morphea, 
2Q 1 10 scleroderma, pemphigoids and pemphigus), keloids; striae, erythema, petechias 

purpura, and xanthelasma. In addition, such disorders may predispose increased ' * 
susceptibility to viral and bacterial infections of the skin (i.e. cold sores, warts, 
chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, erysipelas, 
25 " impetigo, tinea, althJctes foot, and ringworm). Moreover, the protein product of -this 

15 gene may also be useful for the treatment or diagnosis of various connective tissue 
; disorders (i.e.. arthritis, trauma, tendonitis, chrondomaiacia and inflammation, etc.), 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, 
dcrmatomyositis, etc.), dwarfism, spinal deformation, joint abnormalities, amd ' 
■ chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 

20 osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). ' 
35 Funhermore, the protein may also be used to determine biological activity, to raise 

, r antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
* that modulate their interactions; in addition to its use as a nutritional supplement. 

« Protein, as well as, antibodies directed against the protein may show utdity as a tumor. 
40 25 marker and/or immunotherapy targets for the above, listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
t available and accessible through sequence databases. Some of these sequences are 

45 ./ related to SEQ rD NO:74 and may have been publicly available prior to conception of 

1 lne P resen t invention. Preferably, such related polynucleotides are spec ifically 

30 excluded from the scope of the' present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present inventi 
50 more polynucleotides comprising a nucleotide sequence/described by ( the general ' " " 
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formula of a-b, where a is any integer between 1 to 1619 of SEQ ID NO:74, b is an 
integer of 15 to 1633, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 74, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: YLWGRPRLI^RAGTSPSAPWGEKREKLGHKLPVALQGYHPWIL 
LECTVFWARVVLACFSLYLIRGPNCINRQPEPTYQKACNLliCSSDFGQER 
AP AWELLGPES EQRLRE YTAQG LQSL ASS HR WRQFKTEG KMRGGASPLPWLI 
CFW LCSYKGSDNSLKPVVPGPTLCPQSLVSPSVHPStRSASLGRHRAEAA 
(SEQ IE) NO: 328). Polynucleotides encoding these polypeptides are also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: MPGILAG1PVKDLCLSLLQGFRLLLLCVCPGVVL 
SGWMGGQKGSPRIVDIG (SEQ ID NO: 329). Polynucleotides encoding these 
polypeptides arc also provided. This gene maps to chromosome 15, accordingly, 
polynucleotides of the invention is used in linkage analysis as a marker tor • 
chromosome 15. 

This gene is expressed primarily in brain and breast and to a lesser extent in 
the liver, pancreas, and T-cells, 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
riot limited to, disorders affecting the brain and CNS, the reproductive system, or the 
immune system. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell typc(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous system, the reproductive system, and the immune 
system, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., brain and other tissue of the nervous 
system, mammary tissue, eendocrine tissue, hepatic tissue, reproductive tissue, cells 
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and tissue of the immune system, cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or ceil, 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. 
V Preferred polypeptides of the present invention comprise immunogenic 

r5 epitopes shown in SEQ ID NO:. 188 as residues: Met-37 to Ser-43. Polynucleotides * 

encoding said polypeptides are also provided. V 

The tissue distribution in brain cells indicates that polynucleotides and 
10 polypeptides corresponding to.this gene arc useful for the diagnosis and treatment of 
disorders affecting the central nervous system, the reproductive system, and the ■ 
■ t immune system, including cancers. Representative uses are described in the 

"Regeneration" and "Hyperproliferative Disorders" sections below, in Example 1 1, 
25 . 15 • and 18, and elsewhere herein. Briefly, the uses include, but are not limited to the 

15 detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Toiirette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder; depression, panic disorder, 
20 learning disabilities, ALS, psychoses, autism; and altered behaviors, -including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in ' 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
25 neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional - 
supplement Protein, as well as. antibodies directed against the protein may show 
30 utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 
50 available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:75 and may have been publicly available prior ip conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b; where a is any integer between I to 1008 of SEQ ID NO:75, b is an 
integer of 15 to 1022. where both a and h correspond to the positions of nucleotide 
residues shown in SEQ ID NO:75, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

. In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically; polypeptides of the invention comprise the 
following.amino acid sequence: AKGEERKEAFSLKMVQLSSEPISFGLMYLYLGV 
FFHLIYPGALSrTTl.GKHSHPFFTAEQ 

AFSE (SEQ ID NO: 330). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in the brain and the immune system, in 
particular T-cells. , , 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders affecting the brain, such as Alzheimer's or disorders affecting 
the immune system, such as AIDS. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell rype(s). For a number of disorders of the above 
tissues or cells, particularly of the brain and CNS and the immune systems, expression 
of this gene at significantly higher or lower levels is routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, cells and 
tissue of the immune system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken frorn an individual having such a disorder, relative to the standard gene > 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in brain cells and tissues indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the diagnosis and treatment 
of disorders affecting the brain and CNS or disorders affecting the immune system, 
Representative uses are described in the "Regeneration" and "Hyperpfoiiferative 
Disorders" sections below, in Example 1 1. 15, and 18, and elsewhere herein. Briefly^ 
the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia., mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 

■ in regions of the brain indicates it plays a role in normal neural function/ .. 

Potentially, this gene product is involved in synapse formation, 
neurotransrmssion, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 76 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. t Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1 170, of SEQ ID NO:76, b is an 



WO 00/06*98 



168 



PCT/US99/m30 



integer of 15 to 1 184. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:76. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 

The translation product of this gene shares sequence homology with 
pcnaeidin-2 which is thought to be a members of a new family of antimicrobial 
peptides from the hemolymph of shrimps Penaeus vannamei. The molecules display 
antimicrobial activity against fungi and bacteria with a predominant activity against 
Gram- positive bacteria. /. t . , 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GPAHPASPPLMTLS LQL AELVHFVCAFQSQWT 
GVYPMMPPLKPTEPLCFA CVPCR V (SEQ ID NO: 33 1 ). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly 
disorders affecting the spjeen, including bacterial and fungal infections. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hematopoetic 
and immune systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues and cell types (e.g.. immune, hematopoietic, and : 
cells and tissue of the immune system, cancerous and wounded tissues) or bodily 
. fluids (e.g.. serum, plasma, urine, synovial fluid 'and spinal fluid) or another tissue or 
cell sample taken from an individual Having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 
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The tissue distribution in spleen and homology to the penaeidin family of ' 
antibiotics indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and treatment of disorders affecting the spleen, especially 
fungal and bacterial infections. Representative uses are described in the "Immune 
Activity * and "infectious disease" sections below, in Example 11, .13, 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product ' 
indicates a role in regulating trie proliferation; survival; differentiation; and/or * " 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation.' or 
other processes suggesting a usefulness in the treatment of caricer (e.g. by boosting 
immune responses). ,v 

* Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-ceil mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- versus-graft and graft- versus- host diseases, or autoimmunity ' 
disorders/such as autoimmune infertility, lense tissue injury, demyelinatioh, systemic 
lupus erytherrtatosis.'drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent u secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in * 
'■ the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors; to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are ' . 
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related to SEQ ID NO:77 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded frnm the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the genera! 
formula of a-b, where a is any integer between 1 to 298 of SEQ ID NO:77, b is an 
integer of 1 5 to 3 12, where both a and b correspond to the positions of nucleotide- 
residues shown in SEQ (D NO:77, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of TIIP-1 cells to 
calcium. Thus it is likely that the product of this gene is, involved in a signal 
transduction pathway, that is initiated when theproduct binds a receptor on the surface 
of the plasma membrane of both monocytes, in addition to other cell-lines or tissue 
cell types. Thus, polynucleotides and polypeptides have uses which include, but arc 1 
not limited to, activating immune and hematopoietic cells and tissue cell types. 
Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules v such as calcium, potassium and sodium, as well as alter pH and membrane 
potential. Alterations in small molecule concentration can be measured to identify 
supernatants which bind to receptors of a particular cell. 

Moreover, when tested in TF-I cell lines, the protein product of this gene has 
been shown to alter the steady-state messenger RNA levels of the following genes: c-', 
fos, c-jun. egr-U b561, bcl-2, CD40, cyclin D2, CADPH, ICER, MAD3, p21, STAT3, 
ID3, and STAT-1. When tested inlI937 cell lines, the protein product of this gene has 
been shown to alter the steady-state messenger RNA levels of the following genes: 
egr2, IvlKPl, ATF3, B562. cyclin D, cyclin D2, GAT A3, MAD3, p21, TGF, DHFR, 
and JAK3. Based upon these results, it is anticipated that polynucleotides and 
polypeptides corresponding to this gene are useful as agonists or antagonists of the 
above referenced genes. Such activity is useful in therapeutic and/or diagnostic 
applications as referenced and more specifically discussed elsewere herein. 
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In specific embodiments, polpeptides of the invention comprise the 
sequence:MLLEVYGDSISVTVAIPL (SEQ ID NO:332), 

MHSPCQSKAADGLGKSETE (SEQ ID NO: 333), and/or MLKSLGLSTN (SEQ ID 
NO: 3314). Polynucleotides encoding these polypeptides are also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
/the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: AQRLAEECFYMLLEVYGDSISVTVAIPLMHSP 
CQSKAADGLGKSETEMLKSLGLSTNMSPFHLLGLKVFLTWALTLAQICLY 
FFEVQPLGLL ALNFFCTATAGLKELCM HPPSLAFTPEFHTS LS PL AJPS FCGTS 
VSLSNSHTIPLSLYLPFPSKS^lPDTLHLLVHSLPLVHSQyLPVKDVTreWPLC 
QRCLGSTCH Q (SEQ ID NO: 335), Polynucieotides encoding these polypeptides 
are also provided. 

The polypeptide of this gene has been determined to have a' transmembrane 
domain at about amino acid position 1 1 - 27 of the amino acid sequence referenced in 
Table 1 for this gene. Moreover, a cytoplasmic tail encompassing amino acids 28 to 
143 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

This gene is expressed primarily in neutrophils and Trcells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are . 
not limited to, disorders and/or diseases affecting the immune system. Similarly," , 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the the immune 
system, expression of this gene at significantly higher of lower levels is routinely 
detected in certain tissues and cell types (e.g., immune; hematopoietic, cells and tissue 
1 of the immune system, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. * 

Preferred polypeptides of the present invention comprise immunogenic, 
epitopes shown in SEQ ID NO: 191 as residues: Pro-97 to Asp- 104. Polynucleotides 
- encoding said polypeptides are also provided. 

' , The tissue distribution in neutrophils and T-cells. combined with the detected 
calcium flux biological activity indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of disorders 
affecting the immure system. Representative uses are described in the "Immune 
Activity" and "infectious disease" sections below, in Example 11, 13, 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product 
indicates a role in regulating the proliferation; survival: differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or ' 
other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is Involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders' including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou s Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lensc tissue injury, demyelination, systemic 
lupus crythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior.of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
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interactions, in addition to' its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences; such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO;78 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome/ Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
-formula of a-b, where a is any integer between 1 to 1356 of SEQ ID NO:78, b is an- 
integer of 15 to 1370, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:78, and where b is greater than or equal to a + 14. f 

FEATURES OK PROTEIN ENCODED BY GENE NO: 69 

In another embodiment; polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: WIPR A AGIRHEVQySLFQMFCFSSIFCSH 
EHTHLPGTFWLFLFLFLILPPSCPCFLPFSLAIETVRWPCWHHPTSFELCY 
PGTSIYYASRGGPXPNSEX (SEQ TD NO: 336). Polynucleotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the ussue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
■not limited to, diseases and/or disorders affecting the immune system, and neutrophils 
in particular. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues and cell types (e.g., blood cells, 
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and cells and tissue of the immune system, and cancerous and wounded tissues) or 
^ bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another ' 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 1 
fluid from an individual not having the disorder. 

The tissue distribution in neutrophils indicates that poly nucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
disorders affecting the immune system and neutrophils in particular. Representative 
uses arc described in the "Immune Activity" and "infectious disease" sections below, , 
in Example 11, 13, 14, 16, 18, 19, 20. and 27, and elsewhere herein. Briefly, the 
4 expression of this gene product indicates a role in regulating the proliferation; 
survival: differentiation; and/or activation of hematopoietic cell iineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

.Since the gene is expressed in cells of lymphoid' origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
•such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic, 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood ceJIs, or that recruits 
hematopoietic cells to sites, of injury. Thus, this gene product is thought to be useful in 
the expansion of stem ceils and committed progenitors of various biood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 



WO 00/06698 PCT7LS99/17130 

175- , 

antibodies directed against the protein may show utility as a tumor marker and/or 
, immunotherapy targets for the above listed tissues. . 

' Many polynucleotide sequences, such as EST sequences, are publicly . 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 79 arid may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically * 
excluded from the scope of the present invention. To list every, related sequence is 
. cumbersome: Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between. 1 to 3 54 of SEQ ID NO: 79, b is an , . 
integer of 1 5 to 368, where both a and b correspond to the^positions of nucleotide f 
residues shown in SEQ ID NO:79, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 ' 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following anuno acid sequence: XNXKSPLTIGNKSWSSTAVAAALELVDPPGCR. 
NSARDSPELVHLGKGRPRKLMTYLFCSSISLLLLKVHSSGHQDIRKAKSKVP 
. RLLUQCPQQRE (SEQ ID NO: 337). Polynucleotides encoding these polypeptides 
are also provided. •• 
This gene is expressed primarily in smooth muscle. 
Therefore, polynucleotides arid polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but aire 
not limited to, disorders affecting smooth muscle tissue, particularly vascular 
conditions. Similarly, polypeptides and antibodies.directed to these polypeptides are 
useful in providing iinmunological probes for differential identification of the 
tissue(s) or cell type(s): h'or a number of disorders of the above tissues or cells, 
particularly of smooth muscle tissue expression of this gene at significantly higher or 
lower levels is routinely delected in certain tissues or cell types (e.g., muscle, and. 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
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fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression .< 
level in healthy tissue or bodily fluid from -an individual hot having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 193 as residues: Ser-18 to Val-31. Polynucleotides 
. encoding said polypeptides are also provided. 

The tissue distribution primarily in smooth muscle indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of disorders affecting smooth muscle tissue. Moreover, the , 
protein is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis^ and/or atherosclerosis. Furthermore, the protein may also be 
used to determine biologicalactivity, to raise antibodies, as tissue markers, "to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
. addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as.EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 80 and may have been publicly available prior to conception of 
the present invention., Preferably, such related polynucleotides are specifically 1 
excluded from the scope of the present invention. To list every related. sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 1074 of SEQ ID NO:80, b is an 
integer of 15 to 1088, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO; 80, and where h is greater. than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

Iri another embodiment, polypeptides comprising the amino acid sequence of s 
the open reading frame upsLream of the predicted signal peptide are contemplated by 
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the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GPEENLSPSTPSQMPTI^'VKLCLLQVCHGLFP 
LLKHWSQPMPLCVTLAPVS YWL (SEQ ID NO: 3 3 8) . Polynucleotides encoding * 
these polypeptides are also provided. 

This gene, is expressed primarily in fetal heart, smooth muscle, and frontal 

cortex. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, muscular, vascular, or neural diseases and/or disorders, particularly 
defects or injury tn cardiac muscle. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cardiovascular system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., muscular, vascular, neural, developmental, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, amniotic fluid, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in fetal heart indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosing and treating defects 
to the heart either due to injury or congenital defects. Moreover, the protein is useful 
in the detection, treatment, and/or prevention of a variety of vascular disorders and 
conditions, which include, but are not limited to microvascular disease, vascular leak 
syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery disease, 
arteriosclerosis, and/or atherosclerosis. Alternatively, polynucleotides and 
polypeptides corresponding to this gene are useful ;for the detection, treatment,, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
"Hyperproliferative Disorders" sections below, in Example 11,15, and 18. and 
elsewhere herein. Briefly, the uses include, but are hot limited to the detection. 
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treatment; and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral .neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 1 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition; elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are - 
related tu SEQ ID NO: 81 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
. cumbersome. Accordingly, preferably excluded from the present invention axe one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 1848 of SEQ ID NO:8 1 , b is an 
integer of 15 to 1862, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:81; and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

The translation product of this gene shares sequence homology with adipose 
complement related protein which is thought to be important in regulating energy 
metabolism, insulin levels and fat stores. Moreover, Lhe protein product of this gene 
has also been shown to have homology to the complement subcomponent CIQ A- 
chain precursor and HP-25 protein (See Genbank and Geneseq Accession Nos. , , 
erablCAA41664. 1 , clbjlBAA02352.1, and W98013; all references and information 
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available through this accession are hereby incorporated hy reference herein). Based 
on trie sequence similarity, the translation product of this gene is expected to share at 
least some biological activities with complernem proteins. , 1 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: PRVRKEPEAMQWLRVRESPGEATGHRVTMG 
TAALGPVWAALLLr^LMCEIPMVELTFDRAVASDCQRCGDSEDPLbPAHySS 
. ASSSGRPHALPEIRPYMTILKGDKGpPGPMGLPGYMGREGPQGEPGPQGSK 
GDKGEMGSPGAPCQKRFFAFSVGRKTALHSGEDFOTLLraRVFVNLDGC 
FDMATGQFAAPLRniYFFSLNVHSWNYKETYVHIMHNQKEAVlLYAQPS 

. EDD (SEQ ID NO: 339). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in placenta and. fetal kidney, and umbilical 
vein and to a lesser extent in fetal heart, fetal liver/spleen, microvascular endothelial 
cells and cancers of the lung and pharynx. 

Therefore; polynucleotides and polypeptides of the invention are useful as 
reagents for differenrial identification of the tissue(s) or cell type(s) present in a 
' biological sample and for diagnosis of diseases and conditions which include, but are . 
not limited to, vascular, renal, and reproductive diseases and/or disorders, particularly 
cancers of the lung and pharynx. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells ; particularly of the pulmonary and immune systems, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., vascular, renal, reproductive, immune, hematpdietic, pulmonary* and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 195 as residues: Asp-36 to Asp-48, Ser-57 to His-62, 
Lys-77 to Gly-84. Met-V2 to Gly-1 14, Gln-203 to Ile-209, Lys-23 1 to Tyr-239. ' 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in pharynx or lung, combined with the homology to ' 
adipose complement related proteins indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosing and treating cancers of the 
pharynx or lung by modifying the metabolic balance in such tissues. Moreover, the 
protein is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. The gene product may also be 
involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
infection, inflammation, allergy, immunodeficiency etc. in addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Furthermore, the protein may also be used to determine biological 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, 
to identify agents that nriodulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 82 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 604 of SEQ ID NO:82. b is an 
integer of 15 to 1618, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 82, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

The translation product of ihis gene shares sequence homology with a 
hypothetical 54.7 kD protein (F37A4. 1 ) from Caenorhabditis elegans (S wissProt 
locus YPTl^CAEEL, accession P41879). The protein product of this gene also has 
homology to the human NG26 which is thought to contain a human major 
histocompatibility complex class III and is involved in T-cell maturation (See 
Genbank Accession No. gblAAD18079.ll (AF129756); all references and information 
available through this accession are hereby incorporated by reference herein; for 
example, J. Neurochcmi 69 (6), 2516-2528 (1997)) Based oh the sequence similarity, 
the translation product of this gene is expected to share at least some biological 
activities with nitric oxide synthase proteins. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: ML YPGSVYLLQKALMPVLLQGQARLVEECNGRRAkLLACDGNE 
IDTMFVDRRGTAEPQGQKLVICGEGNAGFYEyGCVSTPLEAGYSVLGWNHP 
GFAGSTGVPFPQMEANAMDVVVQFAIHRLGFQPQDIliYAWSIGGFTATWAA 
MSYPDVSAMILDASFDDLVPLALKVMPDSWRGLVTRtVRQHLNLNNAEQLC 
RYQGPVLLIRRTKX^EinTTVPEDIMSNRGNDLLLKLLQHRYPRVMAEEGLRV 
VRQWLEASSQLEEASIYSRWEVEEDWCLSVLRSYQAEHGPDFPWSVGEDMS 
ADGRRQL A I .FL ARKHLHNFE ATHCTPLPAQNFQMPWHL (SEQ ID NO: 340), 
Polynucleotides encoding such polypeptides are also provided.' 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the - 
following amino acid sequence: VCPKWCRFLTMLGHCCYFWQVWPASEALAA 
GPTPSTGSSSPSWkQHIGTSLQKTRGSLPTTTLf SGAGQSTSTGKNPAAGR s 
SLEGALPAGVWPCF AQS PCTGGQQTP SSTGLRSCI VRSPATWWRTP (SEQ ID 
NO: 341). Polynucleotides encoding these polypeptides are also provided. 

Preferred polypeptides of the invention comprise the following amino acid 
' sequence: MPRAAGIRlIEri'REXDSERAPASVPETPfAVTAPHSSSWDTYYQ 
PRALEKHAOSILALASVFWSISYYSSPFAFFYLYRKGYLSLSKVVPFSHYAG 
TLLLLLAGVACXRGIGR\VTNPQYRQFrriLEATHIWQSSENKRQLANYNn) 
FRSWPVDFHWEEPSSRKESRGGPSRRGVALLRPEPLHRGTADTLLNRVKKL 
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PCQITSYLVAHTLGRRMLYPGSVYLLQKAL^PVLLQGQARLV'ffiCNGRRAK 
LLACDGNEIDTNIFVDRRGTAEPQGQKJLVICC^GNAGFYEVGCV 
VLGWNHPGFAGSTGVPFPQNEANAMDVVVQFAIHRLGFQPQDniYAWSI 
GGFTATWAAMSYPDVSAMILDASFDDLVPLALKVMPDSWRGLVTRTVRQ 
HLNLNNAEQLCRYQGPVLLIRRTKDEIITTTVPEDIMSNRGNDLLLKLLQFIRY 
PRVMAEEGLRVVRQWI^ASSQLEEASIYSRWEVEEDWCLSVLRSYQAEHGP 
DFPWSVGEDMSAJ3GRRQLALI^ARKHLHNFEATHCT PLPAQNFQMPWHL 
(SEQ ID NO: 342). Polynucleotides encoding these polypeptides are also provided; A 
preferred polypeptide variant of the invention comprises the following amino acid 
sequence: HERAXGPSRGHGELLSCVLGPRLYKIYRERDSERAPASVPETPTA 
VTAPHSSSWDTYYQP RALEKWADSILALASyp^SlSYYSSPF.AiTYLYRKGY 
LS LS K V V PFSH Y A GTLLLLL AG V ACSEALAAGPTPSTGSSSPSWKQHIGTSLQ 
KTRGSLPTTTLTSGAGQSTSTGKNPAAGRSLEGALPAGVWPCFAQSPCTGG 
QQTPSSTGL RSCLVRSPATWWRTP (SEQ ID NO: 343). Polynucleotides 
encoding these polypeptides are also provided. 

The gene encoding the disclosed cE>NA is believed to reside on chromosome 
6. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chrcrccsomc 6. * ' 

This gene is expressed primarily in cerebellum, pituitary, fetal liver, and 
primary dendritic ceils and to a lesser extent in in a wide range of tissues and . 
developmental stages (i.e. fetal and adult tissue,* etc.). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural, developmental and immune diseases and/or disorders, 
particularly those involving self recognition and T. anc B-cell maturation, and cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunoiugical probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
neural or hormonal system, expression of this gene at significantly higher or lower 
levels is routinely detected in certain tissues or cell types (e.g., neural, developmental, 
immune, hepatic, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
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plasma, amniotic fluid, urine, synovial fluid and spinaJ fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individuaJ not having the disorder. * * 1 \ 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 196 as residues: Thr-23 to Lys-34, Leu-41 to Ser-47, 
Ala-57 to Ala-68, Pro-89 to Gly- 101, Pro- 110 to Pro- 11 7. Polynucleotides encoding 
said polypeptides. are also provided. / 

The tissue distribution in developmental and immune cells, combined with the 
homology to the human major histocompatibility complex class III region, indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
treatment and diagnosis of cancer and other proliferative disorders. Representative 
uses are described in the "Immune Activity" and "infectious disease" sections below, ■' 
in Example 1 1, 13, 14, 16, 18, 19, 20. and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, ache, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; irnmune reactions to transplanted organs and . 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyeiiriation, systemic 
, lupus erythemaiosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's , 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that . 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gchc product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
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in the differentiation and/or proliferation of various cell types. Potentially, this gene 
product is involved in synapse formation, neurotransmission, learning, cognition, 
homeostasis, or neuronal differentiation or survival. Furthermore; trie protein may 
also be used to determine biological activity, raise antibodies, as tissue markers, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or ~ 
immunotherapy targets for the above listed tissues. -■ t . - ;. 

«. Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 83 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2020, of SEQ ID NO:83,- b is an 
integer of 15 to 2034, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:83, and where b is greater than or equal to a 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

The translation product of this gene shares sequence homology with the br-l 
protein from the snail nervous system (EMBL HPBR1GENE) which codes for nitric 
oxide synthetase and which is thought to be important in mediating a variety of 
cellular responses, including vasodilation. Preferred polypeptides of the invention 
comprise the following amino acid sequence: MFKRHQRLKKDSTQAEEDLSEQ 
EQNQLNVLKKHGYVVGRVGRTFLYSEEQKDNIPFEFDADSLAFDMENDPVM 
GTHKSTKQVELTAQDVKDAHWFYDTPGITKENCILNLLTEKEVNIV IV -\ 

PRTf^LKPGMVLH^GAIGRIDr^QGNQS 

QKHAGHTLLQIPMGGKERMAGFPPLVAEDIMLKEGLGASEAVADIKFSSAG 
WVSVTPNFKDRLHLRGYTPEGTVLTVRPPLLPYIVNIKGQRrKKSVAYKTKKP 
PSLMYNVRKKKGKINV (SEQ ID NO: 344). Polynucleotides encoding such 
polypeptides are also provided. 
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A preferred polypeptide fragment of the invention comprises the following " 
amino acid sequence: MLPARLPFRLLSLFLRGSAPTAARHGLREPLLERRCAA 
ASSFQHSSSLGRELPYDPVDTEGFGEGGDMQERFLFPEYILDPEPQPTREKQL 
QELQQQQEEEERQRQQRREERRQQNTLRARSREHPVVGHPDPALPPSGVNCS 
GCGAXLHCQDAGWGYLPREKARTAEADGGLARTVCQRCWLLSHHRRALR 
LQVSREQYLELVSAALRXPGPSLVLYMVDLLDLPDALLPDLPALVGPKQLIV 
LGNKVDLLPQDAPGYRQRLRERLWEDCARAGLLLAPGTKGHSAPSRTSHR 
TGRIRIRRTGPAQWSC7TCG (SEQ ID NO: 345). Polynucleotides encoding these 
polypeptides are also provided. 

When tested against U937 cell lines, supernatants removed from cells 
containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates myeloid cells through the JAK- 
STAT signal transduction pathway. GAS is a promoter element found upstream of 
; many genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is 
a large, signal transduction pathway involved in the differentiation and proliferation 
of cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of 
the GAS element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

This gene is expressed primarily in early stage human brain, smooth muscle, 
and endometrial tumor and to a lesser extent in a variety of tissues representing many 
organs and developmental states;' , t 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cardiovascular, vascular, and neural diseases and/or disorders, 
particularly congestive heart disease and neurological disorders; Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or ceils, particularly of the circulatory and 
neural.systems, expression of this gene at significantly higher or lower levels is ' . - ' 
routinely detected in certain tissues or cell types (e.g., cardiovascular, vascular, 
neural, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma. 
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urine, amniotic fluid, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder,- relative to the standard gene 
* expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. * ' ' r ' 

4 Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 197 as residues: Phe-42 to Leu-48, Pro-53 to Asp-58, 
Pro-81 to Glu-123, Asp-256 to Trp-269, Gly-282 to Ser-306, Arg-333 to Gly-339, 
y Arg-403 to Gln-425, Ser-446 to Asn-452, His-475 to Gln-480, Gly-592 to Met-597, 
Pro-635 to His-642, Lys-667 to Lys-672, Lys-678 to Ser-684. Polynucleotides 
encoding said polypeptides are also provided. . 

The tissue distribution in smooth muscle and vascular tissues, combined with 
the homology to nitric oxide synthetase indicates that polynucleotides and 
polypeptides corresponding to this gene arc useful for diagnosis and treatment of ' 
congestive heart failure and neurological degenerative disorders, polynucleotides and 
^polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
"Hyperproiiferative Disorders" sections below/in Example 11.15; and 18, and 
elsewhere herein. Rriefly, the uses include, but are not limited to the detection, 
treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, . 
Huntington's Disease, Toiirette Syndrome, meningitis; encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder^ depression, panic disorder, * 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
norma! neural function. ; \ 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Moreover, the protein is useful in the detection, treatment, and/or prevention 
of a variety of vascular disorders and conditions, which include, but are not limited to 
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miscro vascular disease, vascular leak syndrome, aneurysm, stroke, embolism, 
thrombosis, coronary artery disease, arteriosclerosis, and/or atherosclerosis. 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate Iigands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. ■ . , . 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly ¥ . 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:84 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically , . ". 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2226 of SEQ ID NO:84, b is an 
integer of 15 to 2240. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:84, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

- The translation product of this gene shares sequence homology with the 
human KE04p, in addition to an unidentifed Celegans gene. 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 9-25 of the amino acid sequence referenced in 
Table 1 for this gene. Moreover, a cytoplasmic tail encompassing amino acids 1 to 8 
of this protein lias also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type If . 
membrane proteins/ . \'i ■ ■ * 

In another embodiment, polypeptides comprising die amino acid sequence of * 
the open reading frame upstream of the predicted signal peptide arc contemplated by 
the present invention. Specifically, polypeptides of the invention comprise. the 
following amino acid sequence: PSFRRERVETGGGGPVTI IGTEGPFLPLPGGTRM 
NMTQARVLVAAVVGLVAVLLYAS1HKJEEGHLAVYYRGGALLTSPSGPGYH 



WO 00/06698 



188 



PCT/US99rt7130 



IMLPniTFRSVQTTLQTDEVKNVPCGf SGGVMIYIDRIEVVNMLAPYAVFDIV 

RNYTADYDKTLIFNKJHHELNQFCSAHTLQEVYIELFDQIPENLKQALQKDL 

NLMAPGLTIQAVRVTKPKIPEAIRRNFELMEAEKTKLLIAAQKQKVV'EKEA 

ETERKKAVIEAEKIAQVAKJRFQQKVMEKETEKRISEIEDAAFLAREKAKA 

DAEYYAAliKYATSNKHKLTPEYIJELKKYQ^ 

ALKYSD IRTGRESS LPS KE ALEPSGEN VIQN KESTG (SEQ ID NO: 346). 
Polynucleotides encoding these polypeptides are aiso provided. 

The gene encoding the disclosed cDN A is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. 

This gene is expressed primarily in fetal tissue, including 8 week whole 
embryo, fetal liver spleen, nine week old early stage human, fetal heartfetal liver, 
fetal lung, and placenta and to a lesser extent in a variety of cancers, and other normal 
tissues. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but' are 
not limited to, cancer and diseases of tctal development. Similarly, ^polypeptides and 
antibodies directed to these polypeptides are useful in providing immunologic^ 
probes for differentia] identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the fetal tissues, especially the 
liver, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e:g., developmental, hepatic, immune, 
hematopoietic, pulmonary, cardiovascular, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or ceil sample taken from an individual having such a disorder, relative to the , 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 198 as residues: Leu-68 to Lys-74, Tyr-109 to Lys- 
115,Gln-200to Val-205, Lys-207 to Lys-214, Glu-237 to Ile-244, Ala-271 toThr- 
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279, Ser-3 17 to Ser-329, Gln-342 to Gly-348. Polynucleotides encoding said 
' polypeptides are also provided. 

. The tissue distribution of this gene (primarily fetal tissue and cancerous tissue, 
both of which are undergoing rapid growth) indicates that polynucleotides and * 
polypeptides corresponding to this gene are useful for treatment and diagnosis of . 
cancer and disorders of fetal development Moreover, the expression within fetal- 
tissue and other cellular sources marked by proliferating cells indicates this protein > ' 
may play a role in the regulation of cellular division, and may show utility in the 
diagnosis, treatment, and/or prevention of developmental diseases and disorders, 
including cancer, and other proliferative conditions. Representative uses are described 
in the "Hyperproliferative Disorders" and " Regeneration;' sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to* control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certuin 
neurodegenerative disorders, such as spinal muscular atrophy (SM A). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, *' 
' detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions arid diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine hiological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify, agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as. 
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antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. ' 

Many polynucleotide sequences, such as EST sequences, are publicly V 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:85 and may have been publicly-available prior to conception of 
the present invention. Preferably; such related polynucleotides are specifically . 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1474 of SEQ ID NO:85,.b is an ' 
integer of 15 to 1488. where both a and b correspond to the positions of nucleotide ' 
residues shown in SEQ ID NO:85, and where b is greater than or equal to a + 14. . 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

When tested against U937 and Jurkat cell lines, supematants removed from 
cells containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates myeloid and T-cells, and to a lesser 
extent in other immune cells and tissue cell types, through the JAK-STAT signal 
transduction pathway. GAS is a promoter element found upstream of many genes * * 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells..' 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In another embodiment, polypeptides comprising the amino acid sequence of. 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: WSTGNASWEKKDNFILSADFEMMGLGNGRR 
SMKSPPLVLAALVAGIIVLGFNYWIASSRSVDLQTRIMELEGRVRRRAAERG 
AVELKJGXEFQGELEKQRRQLDK^^ 

ERLIRVLQDQLKfLQRNYGRLQQDVLQFQKJSrQTNLERK^SYDLSQCINQMI^ 
VKEQCEERICEVTKKGNEAVASRDLSENNDQRQQLQALSEPQPRLQAAGL 
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PHTEVPQGKGNVLGNSKSQTPAPSSEVVLDSKKQVEKEETNEIQVVNEE 
PQRDRLPQEPGREQVVEDRPVGGRGFGGAGELGQTPQVQAALXVSQENPE 
MEGPERDQLV1PDGQEEEQEAAGEGRNQQKLRGEDDYNMDENEAESETDKQ 
AALAGNDRNIDVFNVE DQKRDTINLLDQREKRNHTL (SEQID NO: 347). ■ 
5 Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
15 9. Accordingly, polynucleotides related to this invention are useful as a marker in 

linkage analysis for chromosome 9. 

This gene is expressed primarily in human endometrial tumor and other 
' 10 * tumors and to a lesser extent in a variety, of other healthy adult and fetal tissues 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil lype(s) present in a 
- biological sample and for diagnosis of diseases and conditions which include, but are 

25 .not limited to, developmental diseases and/or disorders, particularly cancer and other 

15 proliferative disorders. Similarly, polypeptides and antibodies directed to these . 
polypeptides are useful in providing immunological probes for differential 
■ . identification of the tissuc(s) or cell type(s). For a number of disorders of the above 
( tissues or cells, particularly of the endometrial tissue, cervix and uterus, expression of 

this gene at significantly higher or lower levels is routinely detected in certain tissues 
20 or cell types (e.g., developmental, reproductive, and cancerous and wounded tissues) 
35 \\, ' or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 

tissue or cell sample taken from an individual having such a disorder, relative to the, 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
40 25 Preferred polypeptides of the present invention comprise imraurio genie 

epitopes shown in SEQ ID NO: 199 as residues: Asn-6 to Lys- 12, Leu-65 to Phe-70,. 
Glu-73 to His-88. GIn-123 toGln-135, Gin- 142 toLeu-156, Arg-173 toGly-181, 
Asp- 1 89 to Gin- 1 99, Scr-204 to Arg-209. Glu-2 1 9 to Gly-225, Gly-229 to Pi o-238, , 
Scr-246 to Asn-256, Glu-263 to Arg-276. Polynucleotides encoding said polypeptides 
30'f are also provided 

The tissue distribution in endometrial tissue indicates that polynucleotides and 
50 . polypeptides corresponding to this gene are useful for diagnosis and treatment of 
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endometrial, cervical and uterine cancer. Because of potential roles in proliferation 
' and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may" also act as a morphogen to control s 
cell and tissue type specification. Therefore, the polynucleotides and polypeptides of 
the present invention are useful in treating, delecting, and/or preventing said disorders 
and conditions, in addition to otlier types of degenerative conditions. Thus this protein 
: may modulate apoptos is or tissue differentiation and is useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions' and diseases. 
The protein is useful in modulating the immune response to aberrant polypeptides, as 
may exist in proliferating and cancerous celts and tissues. The protein can also be 
used to gain new insight into the regulation of cellular growth and proliferation. 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 86 arid. may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the. scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from die present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3 160 of SEQ ID NO: 86, b is an 
integer of 1 5 to 3 174* where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:86, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

The translation product of this gene shares sequence homology with protein 
disulfide isomerase from Acanthamoeba castelianii (See Genbank Locus 
ACAD1SPROA accession L28I74, genpep locus 456013) which isahought to be 
important in converting proteins into their native conformations. The protein product 
of this gene was also shown to have homology to a phosphoiipase C homologue 
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derived from a mast cell cDNA library (See Geneseq Accession No. R994) I). All 
references and information available through these accessions are hereby incorporated 
by reference herein - for example, Gene 150(1), 175-179 (1994). * 

Included in ,this invention as preferred domains are endoplasmic reticulum 
targeting sequence domain and the thioredoxin family active site domain, which were 
identified using the ProSite analysis' tool (Swiss Institute of Bioinformatics). Proteins 
that permanently reside in the lumen of the endoplasmic reticulum (ER) seem to be 
distinguished from newly synthesized secretory proteins by the presence of the C- 
terminal sequence Lys-Asp-Glu-Leu (KDEL) [1,2]. While KDEL is the preferred 
signal in many species, variants of that signal are used by different species. This 'v 
situation is described in the following table. * . 

SignaL Species--- - --—J. .„ „ . 

KDEL ' Vertebrates, Drosophila, Caertorhabditis elegans, plants 
HDEL Saccharomyces cerevisiae, Kluyveromyces lactis, plants ' * 

□DEL. Kluyveramyces lactis 

ADEL Schizosaccharomyces pombe {fission yeast) 

SDEL • Plasmodium falciparum 

The signal is usually very strictly conserved in major ER proteins but some 
minor ER proteins have divergent sequences (probably because efficient retention of 
these proteins is not crucial to the cell). Proteins bearing the KDEL-type signal are not. 
simply heid in the ER, but are selectively retrieved from a post-ER compartment by a 
receptor and returned to their normal location. The concensus pattern is as follows: 
[KRHQSAH DENQ!-E-L>. Thioredoxins are small proteins of approximately one 
hundred amino- acid residues which participate in various redox reactions via the 
reversible oxidation of an active center disulfide bond. They exist in either.a reduced 
form or an oxidized form where the two cysteine residues aire linked in an 
intramolecular disulfide bond. Thioredoxin is present in prokaryotes and cukaryotes 
and the sequence around the rcdox-active disulfide bond is well conserved. 
Bacteriophage T4 also encodes for a thioredoxin but its primary structure is not ' 
homologous to bacterial, pjant and vertebrate thioredoxins. A number of eukaryotic 
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proteins contain domains evolutionary related to thioredoxin, all of them seem to be 
protein disulphide isomerases (PDI). PDI (EC 5.3.4.1) is an endoplasmic reticulum 
enzyme that catalyzes the rearrangement of disulfide bonds in various proteins. The' 
various forms of PDI which are currently known are: - PDI major isozyme: a 
multifunctional protein that also function as the beta subunit of prolyl 4-hydroxylase 
(EG 1.14. 11 .2), as a component of oligosaccharyl transferase (EC 2.4.1.119), as 
thyroxine deiodinase (EC 3.8. 1.4), as glutathione-insulin transhydrogenase (EC 
1.8.4.2) and as a thyroid hormone-binding protein - ERp60 (ER-60; 58 Kd 
microsomal protein). ERp60 was originally thought to be a phosphoinositide-specific 
phospholipase C isozyme and later to be a protease. - ERp72. - P5. All PDI contains 
two or three (ERp72) copies of the thioredoxin domain. The concensus pattern is as ' 
follows: [LIVMF]-[LrVMSTAJ-x-[UVMFYC]-[rWSTHE]-x(2)-[FYWGTN]-C- 
[GATPLVEj-tPH YWSTAl-C-x(6)-[LIVMFYWT]. The two Cs form the redox- 
active bond. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: S LHRFVLS Q AKDEL (SEQ ID NO: 348), FIKFFAPWCGHCKALAPTW 
(SEQ ID NO: 349), and/or FIKFYAPWCGHCKTLAPTW (SEQ ID NO: 350). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the endoplasmic reticulum 
targeting sequence domain and thioredoxin family active site domain of the sequence 
referenced in Table for this gene, and at least 5, 10, 15, 20. 25, 30, 50, or 75 
additional contiguous amino acid residues of this referenced sequence. The additional 
contiguous amino acid residues is N-terrninal or C- terminal to the endoplasmic 
reticulum targeting sequence domain and thioredoxin family active site domain. 
Alternatively, the additional contiguous amino acid residues is both N-terminal and 
C-terminal to the endoplasmic reticulum targeting sequence domain and thioredoxin 
family active site domain, wherein the total N- and C-terminal contiguous amino acid 
residues equal the specified number. Based on the sequence similarity, the translation 
product of this gene is expected to share at least some biological activities with 
thioredoxin proteins. Such activities are known in the an, some of which are 
described elsewhere herein. - , 
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tii another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: RRGRGVPGPRGRRRLWSAACGHCQRLQPTWN 
DLGDKYNSMEXAKVYVAKVDCh'AHSDVCSAQGVRGYPTLKLFKPGQEAV 
KYQGPRDFQTLENWMLQT1.NEEPVTPEPEVTEPPSAPELKQGLYE 
. VAQGDHHKFFAPWCGHCKALAPTWEQLALGLEHSETVKJGKVDCTQHY 
FJXSGNQVRGYPTLLWFRDGKKVDQYKGKRDLESLREYVESQLQRTETGA 
TETVTPSEAPVLAAEPEADKGTVLALTENNFDDTIAEGITFIKFYAPWCGHC 
KTLAFWEELSKKEFPGLAGVr^AEVDCT^ 

KVSEHSGGRDLDS LHRFVLSQAKDEL (SRQ m NO: 351). Polynucleotides * 
encoding these polypeptides are also provided. . . 

This gene is expressed primarily in human chondrosarcoma and endothelial 
cells and to a lesser extent in a wide range of normal and diseased adult and fetal 
tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. chondrosarcoma and other cancers and proliferative disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., vascular, skeletal, 
developmental, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, amniotic fluid, synovial fluid and spinal fluid) or another tissue or cell , t 
sample taken from an individual having such a disorder, relative to the standard gene ' 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. . * 

The tissue distribution in chondrosarcoma, combined with the homology to 
protein disulfide isomerase and phosphplipase C indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
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chondrosarcoma and other cancers and proliferative disorders, and possibly as a 
reagent for in vitro production of proteins. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely oh decisions involving cell differentiation 
and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogento control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in , 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular 'growth and proliferation. Moreover, the . 
expression in endothelial cells indicates the protein is useful in the detection, 
treatment, and/or prevention of a variety of vascular disorders and conditions, which 
include, but are not limited to rniscrovascular disease, vascular leak syncirome, 
aneurysm, stroke, embolism, thrombosis, coronary artery disease, arteriosclerosis, 
and/or atherosclerosis . Furthermore, the protein may also be used to determine . 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may* , 
show utility as a tumor marker ana/or immunotherapy targets for the above listed 
tissues: , . 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:87 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2766 of SEQ ID NO: 87, b is an 
integer of 15 to 2780, where both a and b correspond to the positions of nucleotide 
residues shown in SEQID NO:87, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

This gene is expressed primarily in thyroid and thymus 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases, and conditions which include, but are 
' not limited to, thyroid diseases including thyroid cancer and diseases of function 
including Grave's Disease, hyper- and hypo- thyroidism as well a's Diseases of the 
thymus. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine and immune systems, expression of this gene at 
significantly higher or lower levels is routinely delected in certain tissues or ceil types 
(e.g., endocrine, immune, and cancerous and wounded tissues) or bodily fluids (e.g 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in thyroid cells and tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of diseases of the thyroid and thymus. Representative uses arc 
described in the "Biological Activity", "Hyperproliferative Disorders", and "Binding 
Activity" sections below, in Example 1 U 17, 18, 19, 20 and 27, and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of 
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Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of the 
pancrease (e.g. diabetes me IHtus), adrenal conex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper- 
hypoparathyroidism) , hypothallamus, and testes: Furthermore, the protein may also 
be used to determine biological activity, to raise antibodies, as tissue markers, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or * * 

immunotherapy targets for the above listed tissues. - V 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases" Some of these sequences are 
related to SEQ ID NO:88 and may have been publicly available prior to conception of 
1 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence describee? by the general *. 
formula of a-b, where a is any integer between 1 to 1047 of SEQ ID NO:88, b is an 
integer of 15 to 106 U where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:88, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

The translation product of this gene shares sequence homology with collagen • 
which is thought to be important as a structural material in a variety of human tissues 
and products including hair, nails, muscle and bone. 

A preferred polypeptide fragment of the invention comprises the following, 
amino acid sequence: MRPQGPAASPQRLRGLLLLXLLQLPAPSS ASEIPKGKQK 
AHSGRGRWTCIMECAYkGQQECLV^TGALGPMAFRVHLGSQVGMDSKEK 
RGNV (SEQ ID NO: 352). Polynucleotides encoding these.polypeptides are also 
provided. " 

This gene is expressed primarily in smooth muscle and to a lesser extent in 12 
week old early stage human, epdidymus, healing groin wound, synovial hypoxia, i 
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stromal cells, ulcerative colitis, breast and 8 week old embryo, as well as a variety of 
other normal and diseased cell types from adult andfetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and other proliferativedisordcrs as well a's Diseases of smooth v 
muscle. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscular system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., vascular, 
developmental, reproductive, and cancerous and wounded tissues) or bodily .fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. \ ;,. ; . 

Preferred polypeptides of the present in vention comprise immunogenic 
epitopes shown in SEQ ID NO: 202 as residues: Glu-32 to Glu^S, Pro-63 to Ala-7 1, , . 
Pro-8i to Lys-90, Ser-97 to Trp-I 11, Lys-130 to Ser-135,Leu-147 to Cys-154, Asp- 
179 to Asn-186, Ser-219 to Gly-229. Polynucleotides encoding said polypeptides are 
also provided. • ' , 

The tissue distribution in smooth muscle and homology to collagen indicates * 
that polynucleotides and polypeptides corresponding to this gene are useful for 
trearment and diagnosis of diseases of vascular diseases and/or disorders. 
Representative uses are described in the "Biological Activity", "Hypcrproliferative > 
Disorders", "infectious disease", and "Regeneration'* sections below, in Example 11. 
19, and 20, and elsewhere herein. Briefly, the protein is useful in detecting, treating, 
and/or preventing congenital disorders (i.e. nevi, moles, freckles, Mongolian spots, 
hemangiomas, port r wine syndrome), integumentary tumors (i.e. keratoses, Bowen s 
Disease, basal cell carcinoma, squamous cell carcinoma, malignant melanoma, 
Paget's Disease, mycosis fungoides, and Kaposi's sarcoma), injuries and inflammation 1 
of the skin (i.e.wounds, rashes, prickly heat disorder,, psoriasis, dermatitis), 
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atherosclerosis, uticaria, eczema, photosensitivity, autoimmune disorders (i.e. lupus 
erythematosus, vitiligo, dcrmatorayositis. morphea, scleroderma, pemphigoid, and 
pemphigus), keloids, striae, erythema, petechiae, purpura, and xanthelasma. In 
addition, such disorders may predispose increased susceptibility to viral and bacterial * 
infections of the skin (i.e. cold sores, wans, chickenpbx, molluscum contagiosum, 
herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, and . 
ringworm). 

Moreover, the protein product of this. gene may also be useful for the 
treatment or diagnosis of various connective tissue disorders (i.e., arthritis, trauma, 
tendonitis, chrondomalacia and inflammation, etc.). autoimmune disorders (i.e., 
rheumatoid arthritis, lupus, scleroderma, dermatornyosiu's, etc.), dwarfism, spinal 
deformation, joint abnormalities, amd chondrodysplasias (i.e. spondyloepiphyseal 
dysplasia congenita, familial osteoarthritis, Atelosteogenesis type II/metaphyseaJ ' 
chondrodysplasia type Schrnid). Moreover, the. protein is useful in the detection, 
treatment, and/or prevention of a variety of vascular disorders and conditions, which 
include, but are not limited to miscro vascular disease, vascular leak syndrome, 
aneurysm, stroke, embolism, thrombosis, coronary artery disease, arteriosclerosis, 
and/or atherosclerosis. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use asa ' . 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. , ' 1 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 89 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is. - 
- cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1328 of SEQ ID NO:89, b is an 
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integer of 15 to 1 342, where both a and b correspond to the positions of nucleotide ' 
residues shown in SEQ ID NO: 89. and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

5 This gene is expressed primarily in immune cells and to a lesser extent in a 

wide variety of human tissues. 
15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or celt type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include but are 
20 10 not limited to, T cell or B cell leukemia and various immunodeficiencies. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing ■ 
immunological probes for differential identification of the tissue(s) or.cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
25 ; expression of this gene at significantly higher or lower levels; is routinely detected in 

15 certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual hot having the disorder. 
20 Preferred polypeptides of the present invention comprise immunogenic 

35 . epitopes shown in SEQ ID NO: 203 as residues; GIy-3 to Gln-9. Polynucleotides 

encoding said polypeptides are also provided. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
40 25 immune system diseases such as immunodeficiencies and T cell and/or B cell 

leukemia. Representative uses are described iq the "Immune Activity" and "infectious 
disease" sections below, in Example 11,13, 14, 16, 18, 19, 20, and 27, and elsewhere 
45 herein. Briefly, the expression of this gene product indicates a role in regulating the 

.. ■ ' proliferation: survival; differentiation; and^or activation of hem atopoictic cell 

30 lineages, including hjood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
50 ifl the treatment of cancer (e.g. by boosting immune responses). 
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Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-celi mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-yersus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyeiihation, systemic 
lupus ery the matosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's • 
Disease, and scleroderma. Moreover, the protein may. represent a secreted factor that 
influences the differentiation or behavior of other hfood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences,.are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:90 and may have been publicly available prior to conception of - 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention.' to list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more poly nucieotidescomprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer, between 1 to 756 of SEQ ID NO:90, b is an 
integer of 15 to 770, where both a and b correspond to the positions of nucleotide 
residues, shown in SEQ ID NO:90, and where ivis greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCQDED BY GENE NO: 81 

The translation product of this gene shares sequence homology with IgE 
receptor. See for example. Isolation and Characterization of cDNAs coding for the 
Beta Subunk of the High-affinity Receptor for Immunoglobulin E, Proc. Natl. Acad. 
Sci. U S A. (1988 Sep.) 85(17): 6483-6487' Based on the sequence similarity, the 
translation product of this gene is expected to share at least some biological activities 
with IgE receptor proteins. Such activities are known in the art. some of which are 
described elsewhere herein. IgE and its receptors are believed to have evolved as a 
mechanism to protect mammals against parasites. But other and intrinsically 
innocuous antigens can subvert this system to provoke an allergic response. For 
human populations in industrialized countries, allergy and asthma now represent a far 
greater threat than paras iuc infection, and the main impetus for current studies of the 
IgE system is the hope of understanding. and intervening in the aetiology of allergic 
diseases. The high-affinity receptor for immunoglobulin (Ig) E (Fc epsilon RI) on 
mast cells and basophils plays a key role in IgE- mediated allergies. Fc epsilon RI is 
composed of one alpha, one beta, and two gamma chains, which are all required for 
cell surface expression of Pc epsilon RI, but only the alpha chain is involved in the 
binding to IgE. Fc epsilon RI-IgE interaction is highly species specific, and rodent Fc 
epsilon RI does not bind human IgE. New homolog can be used to develop ant- 
allergic agents. FcR deliver signals. when they arc aggregated at the cell surface. The 
aggregation of FcR having irnmunorecepLor tyrosine-based activation motifs (ITAMs) 
activates sequentially sre family tyrosine kinases and syk family tyrosine kinases that 
connect transduced signals to common activation' pathways shared with other 
receptors. FcR with ITAMs elicit cell activation, endocytosis, and phagocytosis. The 
nature of responses depends primarily on the cell type. The aggregation of FcR 
without IT AM does not trigger cell activation. Most of these FcR internalize their 
ligahds, which can be endocytosed, phagocytosed, or transcy tosed. The fate of 
internalized rcccptor-ligand complexes depends on defined sequences in the 
intracytoplasmic domain of the receptors. The coaggregation of different FcR results 
in positive or negative cooperation! Some FcR without ITAM use FcR with ITAM as 
signal transduction subunits. The coaggregation of antigen receptors or of FcR having 
ITAMs with FcR having immunoreceptor tyrosine-based inhibition motifs (ITIMs) 
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negatively regulates cell activation. FcR therefore appear as the subunits of, 
multichain receptors whose constitution is not predetermined and which deliver 
adaptauve messages as a function of the environment: 

The polypeptide of this gene has been aeterrnirted to have four transmembrane 
domains at about amino acid position 5 1 - 67, 89 - 105, 1 19 - 135, and 190 - 206 of 
the amino acid sequence referenced in Table I for this gene. Based upon these 
characteristics, it lis believed that the protein product of this gene shares structural 
features to type Ola membrane ; prote?ns. f . 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: ETR V KTS LELLRTQLEPTGT VGNTIMTS QPVPN 
ETIIVLPSNVINFSQAEKPEPTNQGQDSLKKH^ 

GnLASASFSPNr^QVTSTLLNSAYPFIGPFFraSGSLSIATEKRLTKLLVHSSLV 

GSILSALSALVGFIILSVKQATLNPASLQ^ 

DCYTAKASI^GXLSLMUCn. 

S YIGNSGMSSKMTI1DCGYEELLTS (SEQ ID NO: 353). Polynucleotides encoding - 
these polypeptides are also provided. 

• A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MMVLSLGULASASFSPNFTQVTSTLLNSAYPFIGPFFFI 
ISGSLSIATCKI^TKXLVHSSLVGSILSALSALVGniLSVKQATLNPASLQC 
ELDKNNIPTRSYVSYFYHDSLYTTDCYTArCASLAGXLSLMLICTLLEFCL 
AVLTAVLRWKQAYSDFPGSVLFLPHSYIGNSGMSSKMTHDCGYEELLTS v 
(SEQ ID NO: 354). Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
1: Accordingly, polynucleotides related to this invention are useful as a marker in 
, linkage analysis for chromosome 1 . 

. This gene is expressed primarily in immune system tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differentiai identification of the tissue(s) or cell type(s) present, in a ' 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune system diseases and/or disorders such as cancer. Similarly, 
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polypeptides and antibodies directed to these polypeptides are useful in providing 
* immunological probes for differential identification of the tissue (s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., immune, hematopoietic, arid cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma; urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. ■ • ; " ■ \. , 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 204 as residues: Gln-23 to Lys-39. GIu-150 to Thr- 
L58 " Polynucleotides encoding said polypeptides are also provided. * 

The tissue distribution in immune cells and tissues combined with the* V ; 
homology to IgE receptor indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune system 
disorders. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 1 1, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 
lineages, including blood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). ■ 1 ■ ■ 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma , immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
.bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, . 
such as T-ccll mediated cytotoxicity; immune reactions to transplanted organs and • 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia^ rheumatoid arthritis, Sjogren's ' 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that " 
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influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be 'useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. The gene product may 
also be involved in lymphopoiesis, therefore, it can be used in immune disorders such 
as, infection, inflammation, allergy, immunodeficiency etc. In addition, this gene 
product may have commercial utility iri the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional* 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:91 and may have been publicly available prior to conception of 
the present invention- Preferably, such related polynucleotides are specifically 
excluded from the scope, of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b T where a is any integer between 1 to 1556 of SEQ ID NO:9 1, b is an 
integer of 15 to 1570, where both a and b correspond to the positions of nucleotide ' 
residues shown in SEQ ID NO:91, and where. b is greater than or equal to a + 14, 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GASCEGGGAAARAALGVHRSQKALLVFRRTL 
SNLLYMPLLRGLLWLQVLCAGPLHTEAVVLLVPSDDGRAFLLRSRLLHPEAH 
VPPAADRGASLQCVLHQAAPKSRPRSPAAGAALLHXPRRTGDEPCREFHGN 
GFPG PJQLTPGECGLPA PS S LLQ1 1 AS AP VRTGS EGQ V VGCPR A R GETGEGLS L 
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APLSSLMFTSRNGLVGC GASCEGGGAAARAALGVHRSQKALLVFRRTLSNL 
LYMPLLRGLL^QVLCAGPLHTEAVVLLVPSDDGRAFLLRSRLLHPEAHVPP 
AADRGASLQCVLHQAAPKSRPRSPAAGAALLHXPRRTGDEPCREFHGNGFP 
GPTQLTPGECGLPAPSSLLQHASAPVRTGSEGQVVGCPRARGETGEGLSLA 
. rLSvSLMFTSRNGLVGC (SEQ ID NO: 355). Polynucleotides encoding these 
polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
7. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 7. ■ . 

This gene is expressed primarily in activated T cells, and to a lesser extent in a 
wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the u'ssue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are r 
not limited to, immune and hematopoietic diseases arid/or disorders, particularly 
immunodeficeincies. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
, identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at : 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 205 as residues: Pro-67 to Ser-73. Polynucleotides 
encoding said polypeptides are also provided 

The tissue distribution in activated T cells indicates that polynucleotides and . 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immunodeficiencies. Representative uses are described in the "Immune Activity" and 
"infectious disease" sections below, in Example 1 1, 13, 14, 16/18, 19, 20, arid 27, and 
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elsewhere herein. Briefly, the expression of this gene product indicates a' role in ■ ~ 
. regulating the proliferation; survival; differentiation; and/or activation of - 
hematopoietic cell lineages, including blood stem cells. This gene product is involved 
in the regulation of cytokine production, antigen presentation, or other processes 
suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is invol ved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as ArpS, leukemia, rheumatoid' arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-celi mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- vcrsus^host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, dcmyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligarids or receptors, to identify agents that modulate their 
interactions, tri' addition to its use as a nutritional supplement: Proiein, as well as, t . 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly^ ' 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 92 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically ; 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by die general 
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formula of Ji-b, where a is any integer between I to 2936 of SEQ ID NO:92, b is an ' 
integer of 15 to 2950, where both a and b correspond to the positions of nucleotide 
■ residues shown in SEQ ID NO:92. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

The translation product of this gene was shown to have homology to the 
human transmembrane protein (See Genbank Accession No; gbl AAC5 1 364. 1 1 
(AF0OO959): all references and information available through this accession are 
hereby incorporated by reference herein; for example; Genomics 42 (2), 245-25 1 
(1997)) which is thought to be implicated in velo-cardio-facial syndrome. 

A preferred polypeptide fragment of the invention comprises the following- 
ammo acid sequence: MGSAALErLGLVLCLVGVVGGLlLACGLPMWQVTAFLD 
HNIVTAQTT^'KGLWMSCVVQSTGTCSAKCTTRCWL (SEQ ID NO: 356). " 
Polynucleotides encoding these polypeptides arc also provided- 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
22. Accordingly, polynucleotides related to this invention are useful as a marker in 
1 linkage analysis for chromosome 22. . 

This gene is expressed primarily in dementia brain tissue, and to a lesser 
extent in a wide variety of human tissues. . 

Therefore, polynucleotides and polypeptides of the invention are useful as * 
reagents for differential identification of the lissue(s) or cell type(s) present in a * 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural diseases and/or disorders, particularly dementia. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., neural, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level; i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 206 as residues: Ser-201 to Tyr-217. Polynucleotides 
encoding said polypeptides are also provided. 

■ The tissue distribution in dementia brain tissue, combined with the homology * 
to the transmembrane proteinindicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of dementia, and 
potentially for velo-cardio-facial syndrome. Representative uses are described in the 
"Regeneration" and "Hyperprol iterative Disorders" sections below, in Example 11, 
15, and 18, and elsewhere herein. Briefly, the uses include, but arc not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries,' ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression! panic disorder, 
learning disabilities, ALS. psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies , as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as ES T sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:93 and may have been publicly available prior to conception of 
the present inventioa Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides 1 comprising a nucleotide sequence described by the general 
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fo|muIa ofa-b t where a is any integer between 1 to 1708 of SEQ ID NO:93', b is an 
integer of 1 5 to 1722, where both a and b correspond to, the positions of nucleotide 
residues shown in SEQ ID NO:93, and where b is greater than or equal to a + 14. 

TSATURiES OF PROTEIN ENCODED BY GENE NO: 84 

In another embodiment, polypeptides comprising the amino acid sequence of 
ihe open reading frame upsLreamof the predicted signal peptide are contemplated by , 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: LKRAPPGPALAKGLLQPSSTFQALETNIGDQVR 
RHSTAVVrRENTfSYTLISFVLLIGVG 

VRMLRGELSCACYRPHVQALQLGGCTCFYSRQ TO NO: 357). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in the adult pulmonary system. 

Therefore, polynucleotides and polypeptides of me invention are us ' 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cystic fibrosis, bronchitis and any pulmonary disorders in general. • 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or ceil , 
type(s). For a number of disorders* of the above tissues or cells, particularly of the * " 
pulmonary system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., pulmonary, cardiovascular, and 
cancerous and wounded tissues) or hodily fluids (e.g., serum, plasma, urine, . v : 1,1 
pulmonary surfactant, pulmonary lavage/sputum, synovial fluid and spinal fluid) or 
another tissue or cell sample taken fruin an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. f 

The tissue distribution of this gene only in the pulmonary sysLem indicates that 
it plays a key role in the functioning of the pulmonary system. This would suggest 
that misregulation of the expression of this protein product in the aduit could lead to 
lymphoma or sarcoma formation, particularly in the lung and the protein product 
could be used either in the treatment and/or detection of these disease, states. The gene 
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or gene product may also useful in the treatment and/or detection of pulmonary , 
defects such as pulmonary edema and embolism, bronchitis and cystic fibrosis. 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate Hgands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues . 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases /Some of these sequences are 
related to SEQ ID NO:94 and may have been publicly "available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from ihe scope of the present invention; to list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 62 1 of SEQ ID NO:94, b is an 
integer of 15 to 635, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:94, and where b is greater than or equal to. a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this gene was found to be homologous to CAM 
proteins. Based on the sequence similarity, trie translation product of this gene; is 
expected to share at least some biological activities with CAM proteins. Such , 
activities are known in the art, some of which are described elsewhere herein.* 

A preferred polypeptide varient of the invention ^comprises the following 
amino acid sequence: MLCPWRTA^GLLLILTIFLVAEAEGAAQPNNSLM - 
LQTSKENH AL ASSSLCMDEKQlt QN YS K VLAE VNTS WP V KM ATN A VLC 
CPPIALRNLIIITV^nU^GQPSCTKAYKKETNETKEJN 

NSDLQIRTVArrHDCYYRCIMVTPDGNFHRGYHLQVLVTPEVTLFQNRNRTA 

VCKAVAGKPAAHISWIPEGDCATKQEYWSNGTVTVKSTCHWEVHNVSTV 

NCHVSHLTGNKSLYIEIXPVPGAKXSSKLYIPYn 

CXKYKLNKPESTPVVEEDEMQPYAFYTEKNNPLXXTTNKVKASEALQSEV 
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DTDLHTL (SEQ ID NO:203). Polynucleotides encoding these polypeptides are also 
provided. 

The polypeptide of this gene has been. determined to have a transmembrane 
domain at about amino acid position 271 - 287 of the amino acid sequence referenced 
in Table 1 for this gene. Moreover, a cytoplasmic tail encompassing amino acids 288 
to 348 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. ,< 

This gene is expressed primarily in dendritic cells. , ,r 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immunodeficiency, tumor necrosis, infection, lymphomas, auto- 
immunities, cancer, metastasis, wound healing, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell typeifs). For a number of disorders of 
the above tissues or cells, panicularly of the immune system; expression of this gene 
at significantly higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., immune, hematopoietic,, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 208 as residues: Asp-53 to Tyr-6 1 , Pro- 105 to Ile- 
128. Arg-133 to Leu-140, Gln-182 to Ala-188, Pro-205 to Asn-218, Gly-259 to Xla- 
264, Asn-290 to Ser-302, GIu-307 to.Tyr-314, Jyr-317 to Lys-332. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in dendritic cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
immune disorders including: leukemias, lymphomas, auto-immunities, 
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immunodeficiencies (e.g. AIDS), immuno-supressive conditions (transplantation) and 
hematopoeitic disorders. In addition this gene product is applicable in conditions of 
general microbial infection, inflammation or cancer. Representative uses are, 
described in the "Immune Activity" and "infectious disease" sections below, in 
Example 11, 13, 14, 16, 18/ 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
- blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation! or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 1 

Since the gene is expressed in cells of lymphoid origin, the natural gene * 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
.bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immuneTeactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor thai 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic 'cells to sites of injury. Thus, this gene product is thought to be useful in . 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to ctetermine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their v 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. V 

Many polynucleotide sequences, such as EST sequences, are publicly- 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:95 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every relaxed sequence is 
10 ' cumbersome. Accordingly, preferably excluded from the present invention are one or 

more polynucleotides comprising a nucleotide sequence described by the general 
5 formula of a-b\ where a is any integer between I to 3784 of SEQ ID NO:95, b is an 
integer of 1 5 to 3798, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:95, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

In another embodiment, polypeptides comprising the amino acid sequenced \ 
the open reading frame upstream of the predicted signal peptide arc contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following .ami no acid sequence: VIKLICPAArTVYFQDiMARGCVCSLCASVCIFLS 
25 SLFPLLPSVHSVNIISCLLLSKCFEGLELMCEHL YQLSQLHVLHHIFSYLLCTP 

15 (SEQ ID NO: 358). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primariiy in embryonic tissue and to a lesser extent in 
in a variety of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
\ • 20 biological sample and for diagnosis of diseases and conditions which include, but are 

35 1101 umi ted to, developmental anomalies, fetal deficiencies, cancer and neoplastic 

. states. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or 
cell type(s). For a number of disorders of the above tissues or cells, particularly of the 
developing fetus .expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., developmental, differentiating, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
45 . . * , amniotic fluid, synovial fluid and spinal fluid) or another tissue or cell sample taken . 

from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily.fluid from an individual not 
having the disorder. 
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The tissue distribution in embryonic tissue indicates that poly nucleotides and 
polypeptides corresponding to this gene arc useful for the diagnosis and treatment of 
developmental anomalies, fetal deficiencies and pre-natal disorders, as well as • * 
abnormal cell proliferation and/or differentiation, neoplastic states and cancer. 
Moreover, the expression within embryonic tissue and other cellular sources marked 
by proliferating cells indicates this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis, treatment, and/or prevention of 
developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern ' 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
. neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders arid conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating arid cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/pr 
immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:96 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2669 of SEQ ID NO:96, b is an 
integer of 15 to 2683, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:96, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BV GENE NO: 87 

The translation, product of this gene shares sequence homology with inter- 
alpha-trypsin inhibitor which is thought to be important in inhibition of trypsin and. . 
other serine proteases (See Genbank Accession No, pir!S30350IS3O350; all references 
and information available through this accession are hereby incorporated herein by 
reference; for example, Eur. J. Biochem. 179 (1). 147-154 (1989), J. Biol. Chem. 264 
(27), !5975-!598I (1989). and J. Biol. Chem. 266 (2) ,'747-75 1 (1991)). 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability, of the plasma membrane of THP- 1 cells to 
calcium. Thus it is likely that the product of this gene is involved in a signal ' 
transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of both monocytes, in addition to other cell-lines or tissue 
cell types. Thus, polynucleotides and polypeptides have uses which include, but are 
not limited to, activating monocytes, and to a lesser extent, other immune and/or 
hematopoietic cells. Binding of a ligand to a receptor is known to alter Intracellular 
levels of small molecules, such as calcium, potassium and sodium, as well as alter pH 
and membrane potential. Alterations in small molecule concentration can be measured 
to identify supematahts which bind to receptors of a particular cell. k , 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
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following amino acid sequence: YXIPGSTHASGRQRGSGRGEDDSGPPPSTVINQ 
NETFANIIFKPTVVQO/VRUQNGU-GDniRYDVNREQSIGDIQVLNGYFV'HYF 
APKX»LPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSUGFS 
MRIKVWKDHLISVTPDSIRDGKVYIHHM 

SGIGDRSVSLIVFLTDG KPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVD 

FRLLEKLSLENCOLTRR VHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPSS VVQ 

ATKTLFPNYFNGSEmAGKLVDRKLDHLHVEVTASNSKKFITLKTDVPVRPQK 

AGKI>VTGSPRPGGDGEGDXNHIERLWSYLtTKELLSSWLQSDDEPEKERLRQ 

RAQALAVSYRFLTPFTSMKLRGPVPRMDGLEEAHGMSAAMGPEPVVQSVR 

GAGTQPGPLLKKPYQPRIKISKTSVnGDPHFVVDFPLSRLTVCFNIDGQPGpiL 

RLVSDHRDSG\nrVNGELIGAPAPPNGHKKQRTYLRTnUINKPERSYLEltPS 

RVILDGGDRLVLPCNQSWVGSWGLEVSVSANANVTVTIQGSIAFVILiHLYK 

KPAPFQRHHLGFYIANSEGLSSNCHGLi-GQI^NQDARLTEDPAGPSQNLTHP 

LLLQVGEGPEAVLTVKGHQVPVVWKQRK3YN GEEQX DCWFARNMPPN 

(SEQ ID NO: 359). Polynucleotides encoding chese polypeptides are also provided. 

This gene is expressed primarily in placenta and adipose tissue and to a lesser 
extent in several other organs and tissues including cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include; but are 
not limited to, disorders of developing organs and metabolic diseases, in addition to 
vascular diseases and conditions. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the developing systems and metabolic systems, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, vascular, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, t i,c., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID'n6: 210 as residues: Lys-5 to Lys- 1 0, Asn-33 toLys-39, 
Asp-48 to Lys-54, Pro-62 to Asp-67, Asn-116 to Arg-123, His-157 to Ala-162, Val- 
242 to Lys-249, Val-25 1 to Asp-264. Polynucleotides encoding said polypeptides are 
also provided. ( '■ * 

The tissue distribution in placenta, combined with the homology to inter- 
alpha-trypsin inhibitor and the detected calcium flux biological acttivity indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of disorders of developing and metabolic systems. This protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis, 
treatment; and/or prevention of developmental diseases and disorders, including 
cancer, and other proliferative conditions. Representative uses are described in the 
"Hyperproiiferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 
and/or apbptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphbgen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Moreover, the protein 
is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but arc not limited to miscrovascuiar disease, 
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vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. Polynucleotides and polypeptides of 
the invention are also useful for the treatment, detection, and/or prevention of 
inflammation, tumor invasion and metastasis, wound healing, liver disease, 
disseminated intravascular coagulation, alzheimcr's Disease, ophthalmic disease, 

, apoptosis, ussue remodeling, intrauterine growth-retardation, preeclampsia, 
angiogeriesis, cell migration, fetal development, trophoblast implantation, ovulation, 

* pemphigus and psoriasis, and antiviral therapy. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed lissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NQ:97 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2 167 of SEQ ID NO:97, b is an 
integer of 15 to 2181, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:97, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 88 

The translation, product of this gene was shown to have homology to the 
human colon carcinoma antigen NY-CO-7 (See Genbank and Gcneseq Accession 
Nos. gblAACI 8038. 11 (AF039689) and WO9904265; all references available through 
this accession are hereby incorporated herein by reference; for example, Int. J. Cancer 
76 (5), 652-658 (1998)). 

This gene is expressed primarily in breast and breast cancer and to a lesser .. 
extent in several other organs and tissues including cancers. 
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Therefore, polynucleotides arid polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of reproductive organs and the gastrointestinal system, 
including cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunu logical probes for differential 
identification of the tissue(s ) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the reproductive systems, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., gastrointestinal, reproductive, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, breast milk, chyme, bile, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention compriseirnrnunogenic 
epitopes shown in SEQ ID NO: 21 1 as residues: Gly-22 to Gly-28 t Leu-71 to Phe-77, 
Asn-101 to Val-108, Pro-122 to Ser-127, ArgW49 to Pro-154, Gly-191 to Phe-196, 
Pro-199 to Thr-21 1. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in breast and hreast cancer tissue, combined, with the 
homology lo a colon cancer antigen indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for 1 treatment and diagnosis of disorders of the 
reproductive systems and cancers. This protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis, treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration " sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
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potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, ' 
detecting, ancVor preventing said disorders and conditions, in addition to other types 
of degenerative conditions'. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in trie detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides^ as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the * 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly , 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:98 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1943 of SEQ ID NO:98, b is an 
integer of 15 to 1957, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:98, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 < 

The translation product of this gene shares sequence homology with the amino 
acid and protein sequence of a Xenopus transmembrane protein of unknown function. 
The very 5'-end of the contig is identical to the mRNA for the human LGN mosaic 
protein. Based on the sequence similarity, the translation product of this gene is 
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expected to share at least some biological activities with LCN mosaic proteins. Such 
activities are known in the art; some of which are described elsewhere herein. 
Preferred polypeptides of the invention comprise the following amino acid sequence: 
PRVRPPTKALAVTPTTFVra 

AiLSFCYGAGKSVHVLRHIGGPEI^PPQALRPRDRRRQEEIDYRPDGGAGDAD 
FH YRGQMGPTEQGP YA KTYEGRREILRERD VDLRFQTGNKSPEVLRAFDVPD 
AEAREHPTV VPSHKSPVLDTKPKETGGILGEGTPKESSTESSQS AKPVSGQDtS 
GNTEGSPAAEKAQLKSEAAGSPDQGSTYSPARGVAGPRGQDPVSSPCG (SEQ 
ID NO:339). Polynucleotides encoding such polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
1 . Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome I . 

This gene is expressed primarily in small intestine and adipocytes and to a 
lesser extent in various other normal and transformed cell types, mostly of endocrine 
origin. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and. for diagnosis of diseases and conditions which include, but are 
not limited to, conditions of growth and metabolism. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the digestive and endocrine 
systems, expression of this gene at significantly higher or lower levels is routinely/ 
detected in certain tissues or cell types (e.g., metabolic, gastrointestinal, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, bile; 
chyme, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 1 • ( ; 

Preferred polypeptides of the present invention comprise immunogenic ' 
epitopes shown in SEQ ID NO: 212 as residues: Pro-40 to Gly-68, Gly-79 to Arg-93, 
Phe-106 toGlu-114, Pro-122 to.His-129, Thr-143 to GIy-149, GIy-155 to Ala-168, 
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Val-171 to GIy-182, Aia-195 to Pro-207, Pro-214 to Val-220. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in small intestine indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for study and treatment of 
disorders of growth and metabolism as well as endocrine abnormalities. Furthermore, 
the protein may also be used to determine biological activity, to raise antibodies, as 
tissue markers, to isolate cognate ligands or receptors, to identify agents that modulate 
their interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets' for the above listed tissues. ' 

. Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:99 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general . 
formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:99, b is an 
integer of 15 to 1 1 12, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:99, and where b is greater than or equal to a 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

The translation product of this gene shares sequence homology with IgE 
receptor beta chain which is thought to be important in immune function. 
This gene is expressed primarily in kidney medulla tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and renal diseases and/or disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of ' 
disorders of the above tissues or cells, particularly of the immune and renal systems, 
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expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or ceil types (e.g., immune, renal, urogenital, and cancerous and 
wounded tissues) or bodily fluids (e.g!, lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 
a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in kidney renal medulla tissue, combined with the 
5 homology to the IgE receptor beta chain indicates that polynucleotides and 
polypeptides corresponding to this gene arc useful for the treatment of immune and 
renal disorders. The protein product of this gene could be used in the treatment and/or 
detection of kidney diseases including renal failure, nephrirus, renal tubular acidosis, 
proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilms Tumor Disease, and congenital kidney abnormalities such as horseshoe 
kidney, polycystic kidney, and Falconi's syndrome. Alternatively, this gene product is 
involved in the regulation of cytokine production, antigen presentation, or other 
processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune £ 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthrius, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia; psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthrius, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermorei the 
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protein may also be used to determine biological activity, raise antibodies, as tissue • 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

\iany polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 00 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general ' • 
formula of a-b, where a is any integer between 1 to 873 of SEQ ID NO: 100, b is an 
integer of 15 to 887, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 100, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

The translation product of this gene shares sequence homology with Diff 40 
gene product (See Genbank Accession No. ghlAAC5l 134. 1 ; all references and 
information available through this reference are hereby incorporated herein). 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: PRVRSIKVTELKGLANHVVVGSVSCETKDLFAALPQVVAVDIN 
DLGTIKLSLEVTWSProKDDQPSAASSVNI^ 

LREQAFYNMLRRQEELENGTAWSLSSESSDDSSSPQLSGTARHSPAPRPLV 

QQPEPLPIQVAFRRPETPSSGPLDEEGAVAPVLANGIiAPY 

ALAEASVEAVGPKSLSWGPSPFfrU'APTHGKHPSPVP^ 

LGTTGS VPTSTDP A PS A HLDS VHKSTDS GPSELPGPTHTTTGST YS A ITTTHS 

APSPLTHTTTGSTHfO>IIS^ > 

YVDFCSSVCDNIFVHY\TGIFFHTLYSSKTL (SEQ ID NO:360), and/or PRVRS 

IKVTELKGLANHVVVGSVSCETKT>LFAALPQVVAVDmDLGTIKLSLEVTWSP 
FDKDDQPSAASSVNKASTVTKRFSTYSQSPPDiTC^ 
NGTAWSLSSESSDDSSSPQLSGTARHSPAP RPLVQQPEPLPIQVAFRRPET 
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PSSGPLDEEGAVAPVLANGHAPYSRTLSHISEASVNAALAEASVEAVGPKSL 
SWGPSPPTHPAPTHGKHPSPVPPALDPGHSATSSTLGTTGSVPTSTD (SEQ ID 
. NO: 361). Polynucleotides encoding these polypeptides are also provided. 
Polypeptides of the invention do not consist of the primary amino acid sequence 
shown as Geneseq Accession NO1W69430, which is hereby incorporated herein by 
reference. ,• 

This gene is expressed primarily in liver and to a lesser extent in gall bladder 
tissue. . ■ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic and endocrine diseases and/or disorders, particularly hepatic 
and gall bladder disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the metabolic and endocrine systems, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., hepatic, metabolic, gall bladder, gastrointestinal, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, bile, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to Lhe standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not. having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO': 214 as residues: Val-9 to Cys-14, Pro-42 to Thr-47, 
Thr-56 to Ala-64, Asp-88 to His-98, Cys-128 to Ser-136, Arg-153 to Trp-i61. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in liver and gall bladder, combined with the homology 
to the diff 40 gene product indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the study and treatment of endocrine and 
metabolic disorders, polynucleotides and polypeptides corresponding to this gene are 
useful for the detection and treatment of liver disorders and cancers. Representative 
uses are described in the "HyperproUferative Disorders", "infectious disease", and 
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"Binding Activity" sections below, in Example 1 I, and 27, and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of 
^ hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and conditions that arc ' 
attributable to the differentiation of hepatocyte progenitor cells. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognale ligands or receptors, to identify agents that modulate their 
interactions; in addition to its use, as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or > 
immunotherapy tai gets for the above listed tissues... 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 101 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is . 
cumbersome, Accordingly, preferably excluded from the present. invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1234 of SEQ ID NO: 101, b is an 
integer of 15 to 1 248, where both a and h correspond to the positions of nucleotide 
residues shown in SEQ ID NO; 10 1 . and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The polypeptide of this gene has been determined to have a trammembrane 
domain at about amino acid position 3 - 19 of the amino acid sequence referenced in 
Table 1 for this gene. Based upon these characteristics, it is believed that the protein 
product of this gene shares structural features to type II membrane proteins. 

, This gene is expressed primarily in fetal brain and to a lesser extent in 
pancreas tumor, melanocyte and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissiie(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but axe , 
not limited to, neural diseases and/or disorders, particularly neurodevelopmental 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
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useful in providing immunological probes for differential identification of the 
tissue(s) or cell : typc(.s). For a number of disorders of the above tissues or cells. 
10 " particularly of the central nervous system/ expressi^ of this gene 

. . . . v higher or lower levels is routinely detected in pertain tissues or cell types (e.g., neural, 

5 developmental, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, ■ • 

serum, plasma, amniotic fluid, urine, synovia] fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue of bodily 
fluid from an individual not having the disorder. 

The tissue distribution in fetal brain tissue indicates that polynucleotides and " 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
developmental disorders of the central nervous* system.' Representative uses are ' " 
*- described in the "Regeneration " and "Hyperproliferative Disorders" sections below, in 
25 Example 1 1,15, and 18, and elsewhere herein. Briefly, the uses include, but are not 

15 Umited to the detection,. treatment, and/or prevention of Alzheimer's Disease, * 
" - Pa riunson's Disease, Huntington s Disease, Tourette Syndrome, meningitis/ " 

encephalitis, demyciinating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, ' 
hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 
disorder, depression, panic disorder, learning disabilities, ALS, psychoses, autism,. 
35 + , and altered behaviors .including disorders in feeding, sleep patterns, balance, and 

V perception. In addition, elevated expression of this gene product in regions of the 

< brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Funhermore, the protein may. also be used to daermine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to ' 
identify agents that moduJate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
30 ' K utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
' Many polynucleotide sequences,, such as EST sequences, are publicly 
50 ; , available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 102 and may have been publicly available prior to conception, 
of the present invention. Preferably/such related polynucleotides are specifically 
10 excluded from the scope of the present invention. To list every related sequence is 

cumbersome. Accordingly, preferably excluded from the present invention arc one or 
5 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between i to 1827 of SEQ ID NO: 102, b is an 
15 integer of 1 5 to 1 84 1 , where both a and b correspond to the positions of nucleotide \ 

residues shown in SEQ ID NO: 102, and where b is greater than or equal to a + 14. 
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The translation product of this gene shares sequence homology with a 
probable membrane protein YGL054c -yeasi (Saccharomyccs cerevisiae). Moreover, 
The translation product of this gene also have homology to the human and 
25 " - mouse cornichon protein which is known to be necessary for both anterior-posterior 

15 and dorsal-ventral pattern formation in conjunction with the EGF receptor signaling 
process (See Genbank Accession Nos. gblAAC98388.1l (AF104398), and spIP52159; 
all references and information available through these accessions are hereby 
incorporated herein by reference: for example, Cell 81 (6). 967-978 (1995)). 

♦ The polypeptide of this gene has been determined to have two transmembrane 
domains at about amino'acid position 57 - 73, and 121 - 137 of the amino acid ' 
35 sequence referenced in Table 1 for this gene. Moreover, a cytoplasmic tail 

encompassing amino acids 1 - 14 of this protein has also been determined. Based 
upon these characteristics, it is believed that the protein product of this gene shares 
structural features to type Ola membrane proteins. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: YGCEKTTEGGRRRRRRMEAVVFVFSLLDCCAL 
IH^VYFirn^DLECDYINARSCCSKLNKWyiPELIGHT^ 
IHXM^PVATWNIYRYIMVPSGNMG 
LLC FFMYLYSMILALIND (SEQ ID NO:362). Polynucleotides encoding these 
50 polypeptides are also provided. ' 
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The gene encoding the disclosed cDNA is believed to reside on chromosome 
i. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis . for chromosome 1.,/ \ 

This gene is expressed primarily in activated T-cells and to a lesser extent in 
endometrial tumor, T cell helper II cells, microvascular endothelial cells, Raji cells 
treated with cyclohexamide and umbilical vein endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differentia] identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune, hematopoietic, and vascular diseases and/or. disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely delected in certain tissues or cell types (e.g., immune, hematopoietic, 
vascular, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, amniotic fluid, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a- disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. ■ . 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 2 1 6 as residues: Ser-39 to Asn-45, Asn- 1 03 to Ser- 
109. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in activated T r cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immune disorders involving activated T-cells. Representative uses are described in 
the "Immune Activity" and "infectious disease" sections below, in Example 1 1, 13, 
14, 16, 18, 39, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival: differentiation; and/or - 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
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other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). 

Since the gene is expressed in ceils of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for ' 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou's Disease, infiammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
' such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demye I ination. systemic 
lupus erythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells^ or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Moreover, the protein 
is useful in the detection, trcatmcat, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery \ 
disease, arteriosclerosis, and/or atherosclerosis. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases.. Some of these sequences arc 
related to SEQ ID NO: 103 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b/where a is any, integer between 1 to 671 of SEQ ID NO: 103. b is an 
integer of 15 to 685 , where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 103. and where b is greater than or equal to a + 14. 

FEATURES OF. PROTEIN ENCODED BY GENE NO: 94 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
, the present invention/Specifically, polypeptides of the invention comprise the 
following amino acid sequence: ARAPAPSLPPLPSPAPALAPAHSLLGLLLGRMS 
GSSLPSALALSLLLVSGSLLPGPGAAQNVRVQSGQDQ (SEQ ID NO: 363). 
Polynucleotides encoding these nolypeptides are also provided. 

This gene is expressed primarily in dendritic ceils and to a lesser extent in 
healing abdomen wound, and pancreas islet cell tumor cells. 

Therefore, polynucleotides and polypeptides of the invention are 1 useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly 
wound healing disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type (s). For a' number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene ar 
significantly higher or lower levels is routinely detected in certain tissues or cell types " 
(e.g.. immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or ; 
cell sample taken from ah individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 2 17 as residues: Gln-34 to Lys-40. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in dendritic cells and early healing wound indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treating 
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wounds to enhance the healing process. Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11, 13, 14, 
16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. Tliis gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer <e.g ; by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis,; acne, neutropenia, neutrophilia, psoriasis, hypersensitivities 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic • 
lupus crythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue ■;, 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as/ 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly * 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 04 and mny have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
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cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to U 54 of SEQ ID NO: 104, b is an 
integer of 1 5 to 1 1 68, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 104, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of aortic smooth muscle 
cells to calcium. Thus it, is likely that the product of this gene is involved in a signal 
transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of both smooth muscle cells, and in other cell-lines or tissue 
cell types. Thus, polynucleotides and polypeptides have uses which include, but are 
not limited to, activating smooth muscle cells. Binding of a ligand to a receptor is 
known to alter intracellular levels of small molecules, such as calcium^ potassium and 
sodium, as well as alter pH and membrane potential., Alterations in small molecule 
concentration can be' measured to identify supernatanis which bind to receptors of a 
particular cell. • . * 

• This gene is expressed primarily in pancreatic carcinoma, gall bladder and 
primary dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic and immune diseases and/or disorders, particularly cancers, 
such as pancreatic carcinoma and gall bladder tumor. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissuc(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression 
of this gene at significantly higher or lower levels is routinely detected in certain 
tissues or cell types (e.g. i metabolic, immune, hematpoietic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an indi vidual having such 
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a disorder, relative to. the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. " 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 218 as residues: Lys-34 to Ilc-4I . Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in pancreatic carcinoma and gall bladder indicates that 
polynucleotides arid polypeptides corresponding to this gene are useful for diagnosing 
and treating cancer, such as pancreatic carcinoma and gaJl bladder tumors^ 
Representative uses arc described here and elsewhere herein; Alternatively, the , v 
detected calcium flux biological activity indicates the protein is usefuj in the 
. detection, treatment, and/or prevention of a variety of vascular disorders and 
conditions, which include; but are not limited to miscrovascular disease, vascular leak 
syndrome; aneurysm, stroke, embolism, thrombosis, coronary artery disease, 
arteriosclerosis,, and/or atherosclerosis. Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to . 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
protein may show utilityas a tumor marker and/or immunotherapy targets for the 
above listed tissues. / ^ ' - \ ■"' ■ > ? 

Many polynucleotide sequences, such as EST sequences, are publicly 
available arid accessible through sequence databases. Some of these sequences are 
related to SEQ ID-NO: 105 and may have been publicly available prior to conception „, 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome^ Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 161 of SEQ ID NO: 1 05, b is an 
integer of 15 to 1 175, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 105/ and where b is greater than or equal to a + 14. ' 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

The polypeptide of this gene has beendetennined to have a transmembrane 
domain at about amino acid' position 10 - 26 of the amino acid sequence referenced in 
Table I for this gene. Moreover, a cytoplasmic tail encompassing amino acids 27 to 
48 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type lb 
membrane proteins. 

■ This gene is expressed primarily in osteosarcoma, wilm's tumor, ovarian 
cancer and in T-ceUs. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditioas which include, but are 
not limited to, inflammatory diseases and cancers, such as osteosarcoma, wilm's 
tumor and ovarian cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., skeletal, renal, reproductive, immune, and cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum .plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e.. the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 219 as residues: Ser-30 to Pro-35. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of inflammatory 
conditions and cancer, such as osteosarcoma, wilm's tumor and ovarian cancer. 
Moreover, the expression within cellular sources marked by proliferating ceils 
indicates this. protein may play a role in the regulation of cellular division, and may 
show utility in the diagnosis, treatment, and/or prevention of developmental diseases 
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and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "Hyperproliferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving ceil differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation arid differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification/Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions., Thus this protein may modulate 1 apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
rharkers, to isolate cognate ligarids or receptors, to identify agents that modulate their ' 
interactions, in addition to its use as a nutritional supplement: Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available arid accessible through sequence databases . Some of these sequences are 
related to SEQ ID NO: i 06 and may have been publicly available, prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 007 of SEQ ID NO: 106, b is an 
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integer of 15 to 1021, where both' a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 106. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
1 the present invention. Specifically, polypeptides of the invention comprise the 
foUowing amino acid sequence: GTSKDCVLYAFLDPGMAVPLFLYIFTLLPLLPF 
LLSLCFSPLTVKRSSSSESKSSL (SEQ ID NO: 364). Polynucleotides encoding 
these polypeptides are also provided. 

This gene is expressed primarily in ovarian cancer. 

Thereforc,.polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of trie tissuc(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, ovarian cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes jfor differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.gi, reproductive, ovarian, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial amniotic fluid, fluid or spinal fluid) or another tissue 
or cell sample taken frpnran individual having such a disorder, relative to the 
standard gene expression level; i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise Immunogenic 
epitopes shown in SEQ ID NO: 220 as residues: Thr-28 to Ser-40. Polynucleotides 
encoding said polypeptides arc also provided. 4 

The tissue distribution in ovarian tissues indicates that polynucleotides and 
polypeptides corresponding to this gene are' useful for treating and diagnosing cancer, 
e.g., ovarian cancer. Moreover, the expression within cellular sources marked by 
proliferating cells indicates this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis, treatment, and/or prevention of 
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developmental diseases and disorders, including cancer, and oiher proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apbptosis in pattern r 
formation. ' ■■ " , 

, Dysregulation of apoptosis can result in inappropriate suppression of ceil 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
.also act as a morphpgen to control cell arid tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting; and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and lis useful in the detection, treatment, and/or prevention of ■ 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or . 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 07 and may have been publicly available prior to conception 
of the present invention. Preferably,* such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 



WO 00/06698 ■* PCTYUS99/I7130 

. • 241 • 

formula of a-b, where a is any integer between 1 to 816 of SEQ ID NO: 107, b is an 
integer of 1 5 to 830, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 107. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

This gene is expressed primarily in macrophages and breast cancer tissue and 
to a lesser extent in osteoblasts and smooth muscle: 

Therefore, polynucleotides and polypeptides of the invention arc useful as * 
reagents for differential identification of the 'tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but. are 
not limited to. immune system dysfunction; inflammation; breast cancer; cancer; ' 
osteoporosis; osteopetrosis; peristaltic disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune and skeletal 
systems, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (eg., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another ' 
tissue or cell sample taken from an individual having such a disorder, relative to the * 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. . .' 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 221 as residues: Glu-16 to Ala-40. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and/or treatment of a variety of 
disorders. Expression in macrophages and other hematopoietic cell types indicates 
that this gene product is involved in the regulation of hematopoietic cell survival, 
proliferation, differentiation, or activation. It is involved in the control of such 
processes as immune surveillance, antigen presentation, T cell activation, cytokine 
release, and inflammation. Expression in breast cancer tissue may possibly correlate 
with the diagnosis and differentiation of cancerous tissue from normal breast tissue. 
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Expression in osteoblasts and osteoclasts may implicate this gene product in the 
process of bone turnover, and target it as a likely candidate for.thc treatment of 
osteoporosis and/or osteopetrosis. Finally, expressio in smooth muscle may indicate 
an involvement in the normal function of numerous internal organs and in the 
function of the digestive system. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases; Some of these sequences are 
related to SEQ ID NO: 108 and may t have been publicly available prior to conception 
of the present invention. Preferably, such relatedpolynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is . 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1287 of SEQ ID NO: 108, b is an 
integer of 1 5 to 1301, where both a and b correspond to* the positions of nucleotide 
residues shown in SEQ ID NO: 108, and where b is greater than or equal to a + 14. 
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Table 1 summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as 'TsT SEQ ID NO:X" was assembled 
from partially homologous ("overlapping") sequences obtained from the "cDNA 
clone ID" identified in Table 1 and, in some cases, from additional related DNA 
ciones. '"Tie overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:5C 

The cDNA Clone TO was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits 
contain multiple different clones corresponding to the same gene. "Vector" refers to 
the type of vector contained in the cDNA Clone ID. 

'Total NT Seq." refers to the total number of nucleotides in the contig 
identified by "Gene No." The deposited clone may contain all or most of these 
sequences, reflected by the nucleotide position indicated as "5* NT of Clone Seq." 
and the "3 ' NT of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID 
NO:X of the putative start codon (methionine) is identified as "5' NT of Start Codon." 
Similarly , the nucleotide position of SEQ ID NO:X of the predicted signal sequence 
is identified as "5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning wiih the methionine, is 
identified as "A A SEQ ID NO:Y " although other reading frames can also be easily ■ 
.translated using known molecular biology techniques. The polypeptides produced by 
these alternative open reading frames are specifically contemplated by the present 
invention. 

The first and last amino acid position of SEQ ID NO:Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep.** The 
predicted first amino acid position of SEQ ID NO: Y of the secreted portion is 
identified as "Predicted First AA of Secreted Portion." Finally, the amino acid 
position of SEQ ID NO: Y of the last arnino acid in the open reading frame is 
identified as "Last AA of ORF" 

SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 
otherwise suitable for a variety of uses well known in the art and described further 
below. For instance, SEQ ED NO:X is useful for designing nucleic acid hybridization 
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probes that will detect nucleic acid sequences contained inSEQ ID NO:X or the 
cDNA contained in the deposited clone. These probes will also hybridize to nucleic 
acid molecules in biological samples, thereby enabling a variety of forensic and 
diagnostic methods of the invention: Similarly, polypeptides identified from SEQ ID - 
NO: Y may be used to generate antibodies which bind specifically to the secreted 
proteins encoded by the cDNA clones identified in Table 1. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or , 
deletions of nucleotides in the generated DNA sequence/ The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino 
acid sequence. In these cases, the predicted ami™ acid sequence diverges from the 
actual amino acid sequence, even though the generated DNA sequence may be greater 
than 99.9% identical to the actual DNA sequence (for example, one base insertion or 
deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:X and (he predicted 
translated amino acid sequence identifiedas SEQ ID NO:Y, but also a sample of *' 
plasmid DNA containing a human cDNA of the invention deposited with the ATCC, 
as set forth in Table 1 . The nucleotide sequence of each deposited clone can readily 
be determined by sequencing the deposited clone in accordance with known methods^ , 
The predicted amino acid sequence can then be verified from such deposits.* 7 
Moreover, the amino acid sequence of the protein encoded by a particular clone can 
also be directly determined by peptide sequencing or by expressing the protein in a 
suitable host cell containing the deposited human cDNA, collecting the protein, and * 
determining its sequence. ,■ 

The present invention also relates to the genes corresponding to SEQ ID 
NO:X, SEQ ID NO:Y, or the deposited clone. The corresponding gene can be 
isolated in accordance with known methods using the sequence information disclosed H 
herein. Such methods include preparing pmhes or primers from the disclosed 
sequence and identifying or amplifying the corresponding gene from appropriate 
sources of genomic material. 
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Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from * 
the sequences provided herein and screening a suitable nucleic acid source for the 
desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly 
produced polypeptides, synthetically produced polypeptides, or polypeptides 
produced by a combination of these methods. Means for preparing such polypeptides 
are well understood in the art. 

The polypeptides may be in .the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). ' 
It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production.* 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantly produced 
version of a polypeptide, including the secreted polypeptide, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67:3 1-40 
(1988). Polypeptides of the invention also can be purified from natural or 
recombinant sources using antibodies of the invention raised against ihe secreted , 
protein in methods which are well known in the art. 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of 
McGeoch, Virus Res. 3:271-286 (1985), uses the information from a short N-terminal 
charged region and a subsequent uncharged region of the complete (uncleaved) 
protein, the method of von Heinje, Nucleic Acids Res. 14:4685-4690 (1986) uses the 
information from the residues surrounding the cleavage site, typically residues -13 to 
+2, where +1 indicates the amino terminus of the secreted protein. The accuracy of </ 
predicting the cleavage points of known mammalian secretory proteins for each of 
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these methods is in the range of 75-80%. (von Hcinje, supra.) However, the two * 
methods do noi always produce the same predicted cleavage point(s) for a given 
protein. ' ■* 

In the present case, the deduced amino acid sequence of the secreted 
polypeptide was analyzed by a computer program called SignaJP (Henri k Nielsen et 
al;, Protein Engineering 10: 1 -6 ( 1997)), which predicts the cellular location of a 
* protein based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The analysis - 
of the amino acid sequences of the secreted proteins described herein by this program 
provided the results shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO:Y which have an N-terminus beginning within 5 residues (i.e., " ■ • ■ 
+ or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that 
in some cases, cleavage of the signal sequence from a secreted protein is not entirely 
uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present 
invention.. - 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For- example, the 
naturally occurring signal sequence may be further upstream from the predicted signal ! 
sequence. However, it is likely that the predicted signal sequence will be capable of 
directing the secreted protein to the ER. These polypeptides, and the polynucleotides 
encoding such polypeptides, are contemplated by the present invention^ 

Polvnuclwillde and Poly peptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential 
properties thereof. Generally; variants are overall closely similar, and, in many b 
regions, identical to the polynucleotide or polypeptide of the presenttinvention. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
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"identical " to a reference nucleotide sequence of the present invention, h is intended 
s that, the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point , 
mutations per each 100 nucleotides of the reference nucleotide sequence encoding the 
polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence 
at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides 
in the reference sequence may be deleted or substituted with another nucleotide* or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. The query sequence may be an entire, 
sequence shown inTable 1 , the ORF (open reading frame), or any fragement specified 
as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%. 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for determing die best overall match 
. between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using 
th? FASTC3 computer program based on the algorithm of Brutiag et al. (Comp. App. 
Biosci. (1990) 6:237-245). In a sequence alignment the query and subject sequences 
are both DNA sequences. An RNA sequence can he compared by converting U*s to 
Vs. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB alignment of DNA sequences to calculate percent 
identiy arc: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining Penalty=30, * 
Randomization Group Length=0, Cutoff Sc6fe=l. Gap f enaJty=5,* Gap Size Penalty 
0.05, Window Size=500 or the lenghtof the subject nucleotide sequence, whichever is 
shorter. ■ t ' . 

If the subject sequence is shorter than the query sequence because of 5 * or 3' 
deletions, not because of" internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5 ? and 3\ 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or V ends, relative to the the query sequence! the 
percent identity is corrected by calculating the number of bases of the query sequence 



WO 00/06698 



267 



PCT/US99/17I30 



that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent 
- of the total bases of the query sequence. Whether a nucleotide is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only bases outside the 
5 ' and 3* bases of the subject sequence, as displayed by the FASTDB alignment, 
which arc not matched/aligned with the query sequence, are calculated for the 
purposes of manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the 
subject sequence and therefore, the FASTDB alignment does not show a 
. matched/alignement of the first 10 bases at 5- end. The 10 unpaired bases represent' 
10% of the sequence (number of bases at the 5' and 3' ends not matched/total number 
of bases in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 bases were perfectly 
matched the final percent identity would be 90%. In another example, a 90 base 
subject sequence is compared with a 100 base query.sequence. This time the. 
deletions are internal deletions so that there are no bases on the 5' or 3* of the subject 
sequence which are not matched/aligned with the query, in this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only.bases 5' 
and 3* of the subject sequence which are not matched/aligned with the query sequnce 
are manually corrected for. No other manual corrections are to made for the purposes 
of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
identical" to a query amino acid sequence of the present invention, it isuntended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the query amino acid sequence. In other 
words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted,- (indels) or substituted with another amino acid. 
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These alterations of the reference sequence may occur at the amino or carboxy 
terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 
determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of 
Brutiag et al. (Comp. App. Biosci. (1990) 6:237*245); In a sequence alignment the 
query and subject sequences are either both nucleotide sequences or both amino acid 
sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0* 
k-tuplc=2. Mismatch Penalty= 1 , Joining Penalty=20, Randomization Group 
Length=6, Cutoff Scores L, Window Size=sequence length, Gap Penalty=5, Gap Size 
Pena!vy=".C5, Window Size=500 or the length of the subject. amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terrninai deletions, not because, of internal deletions, a manual correction must be 
made to the results. This is becuase the FASTDB program does not account for isl- 
and C-terminal truncations of the subject sequence when calculating global percent 
identity. For subject sequences truncated at the N- and C-termini. relative to the the 
query sequence, the percent identity is corrected by calculating the number of residues 
of the query sequence that arc N- and C-terminal of the subject sequence, which are 
not matched/aligned with a corresponding subject residue, as a percent of the total 
bases of the query sequence: Whether a residue is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity; calculated by i he above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score 
is what is used for the purposes of the present invention. Only residues to the N- and 
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: C-termini of the subject sequence, which are not matched/aligned with the query 
* sequence, are considered for the purposes of manually adjusting the percent identity 
score. That is, only query residue positions outside the farthest N- and C-terminal 
residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terrninus of the subject sequence and therefore! the FASTDB alignment does not 
showa matching/alignment of the first 10 residues at the N-terminus. The 10 
unpaired residues represent 10% of the sequence (number of residues at the N- and C- 
termitii not matched/total number- of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 residues were perfectly matched the final percent identity would be 
90%. In another example, a 90 residue subject sequence is compared with a 100 ■ 
residue query sequence. This time the deletions are internal deletions so there are no 
residues at the N- or C-termini of the 'subject sequence. which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal 
ends of the subject sequence, as displayed in the FASTDB alignment, which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding 
regions, or both. Especially preferred are polynucleotide variants containing 
alterations which produce silent substitutions, additions, or deletions, but do not alter 
the properties or activities of the encoded polypeptide. Nucleotide variants produced 
by silent substitutions due to the degeneracy of the genetic code are preferred. 
Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or 
added in any combination are also preferred. Polynucleotide variants car. be produced 
for a variety of reasons, e.g., to optimize codon expression for a particular host 
(change codons in the human mRNA to those preferred by a hacterial host such as E. 
coli). . • . ' ; 

Naturally occurring variants are called "allelic variants." and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 



WO 00/06698 \* PCT/US99/17130 

, '.; , ' ' 270 ■ 

organism. (Genes It Lewin, B.. ed, John Wiley & Sons, New York (1985).) These 
allelic variants can vary* at either the polynucleotide and/or "poly peptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known' methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the , 
polypeptides of the present invention. For instance, one or more amino acids can be ' * 
deleted from the N-terminus or C-terrnihus of the secreted protein without substantial 
loss of biological function. The authors of Ron ct al., J. Biol. Chem. 268: 2984r2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
deleting 3, 8, or 27 amino- terminal amino acid residues. Similarly/ Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from 
the carboxy terminus of this protein. (Dobeli et al.. J. Biotechnology 7: 199-2 16 1 . 
0988).) ' .,/ " 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 11 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3.500 individual IL- 1 a. mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every - 
possible amino acid position. The investigators found that "[m]ost of the molecule 
could be altered with little effect on either [binding or biological activity]." (See, '.' - 
Abstract.) In fact, only 23 unique amino acid sequences, out of more than 3,500 
nucleotide sequences examined, produced a protein that significantly differed in 
activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N- terminus or . 
C- terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the. ability. of 
a deletion variant to induce and/or to bind antibodies which recognize the secreted 
form will likely be retained when less than -the majority of the residues of the secreted 
form are removed from the N-terminus or C-terminus. Whether a particular 
polypeptide lacking N- or C- terminal residues of a protein retains such immunogenic 
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activities can readily be determined by routine methods described herein and 
otherwise known in the art - ! 

', Thus, the invention further includes polypeptide variants which show 
substantial biological activity. Such variants include deletions, insertions, 
inversions, repeats, arid substitutions selected according to general rules known in the 
art so as have little effect on activity For example, guidance concerning, how to make 
phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al.; 
Science 247: 1306-1310 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. ' 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution; By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 1 
positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the 
protein. V * • • • . 

The second strategy uses genetic engineering to introduce amino acid changes 
at specific positions of a cloned gene to identify regions critical for protein function. 
For example, site directed mutagenesis or alaninc-scanning mutagenesis (introduction 
of single alanine mutations at every residue in the, molecule) can be used^ 
(Cunningham and Wells. Science 244:1081-1*085 (1989).) The resulting mutant , ' • 
molecules can then be tested for biological activity. f , 

As the authors state, these two strategies have-rcvcaled that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of the protein) amino * 
acid residues require nonpolar side chains, whereas few features of surface side chains' 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val. Leu and 
lie; replacement of the hydroxy! residues Ser and Thr; replacement of the acidic 
residues Asp and Glu; replacement of the amide residues Asn and Gin, replacement of 
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the basic residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, 
and Trp, and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, 
where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substitucnt group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino . 
acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of 
charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of 
pharmaceutical formulations both reduces activity and increases clearance due to the 
aggregate's immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:33 1-340 
(1967); Robbins et al.. Diabetes 36: 838-845 (1987); Cleland et al., Crii. Rev. 
Thcrapeuuc Drug Carrier Systems 10:307-377 (1993).) 

A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of the present invention having an amino acid 
sequence which contains at least one amino acid substitution, but not more than 50 
amino acid substitutions, even more preferably, not more than 40 amino acid 
substitutions, still more preferably, not more than 30amino acid substitutions, and 
still even more preferably, not more than 20 amino acid substitutions. Of course, in 
order of ever-increasing preference, it is highly preferable for a polypeptide to have 
an amino acid sequence which comprises the amino acid sequence of the present 
invention, which contains at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3. 2 or 1 " 
amino acid substitutions. In specific embodiments, the number of additions, 
substitutions, and/or deletions in the amino acid sequence of the present invention or 
fragments thereof (e.g., the mature form and/or other fragments described herein), is 
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1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are 
preferable. 

Polynucleotide and Pol ypeptide Frngm^nK 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 
nt, and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt 
in length," for example, is intended to include 20 or more contiguous bases from the 
cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and 
primers as discussed herein.. Of course, larger fragments (e.g., 50, .150, 500, 600, 
2000 nucleotides) are preferred. . " 

Moreover, representative examples of polynucleotide fragments of the 
invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50,51-100, 101-1 50*. 151-200, 20 1-250 T! 25 1-300, 301-350,351-400,401- 
450, 45 1-500. 501-550, 551-600, 65 1 -700, 70 1-750, 75 1-800, 800-850, 851-900, 901- 
950,951-1000, 1001-1050, 1051-1 100, 1 101-1150, 1151-1200, 1201-1250, 1251- 
1300,1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
1650, 1651-1700,1701-1750,1751-1800. 1801-1850, 1851-1900, 1901-1950, 1951- 
2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the deposited 
clone. In this context ''about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed 
herein. ' ' 

In the present invention, a ' polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO:Y or encoded by the cDNA contained in the 
deposited clone. Protein fragments may be "free-standing,'' or comprised within a 
larger polypeptide of which the fragment forms a pan or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
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invention, include, for example, fragments from about amino acid number 1-20, 21- 
, 40, 41-60, 61-80, 8 1-100, 102-120, 121-140. 141-160, or 161 to the end ofthccoding 
region. Moreover; polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100. 1 10, 120, 130, 140, or 1 50 amino acids in length. In this context "about'' ■< 
includes the particularly recited ranges, larger or smaller by several (5, 4; 3, 2 f or 1 ) 
armno acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from. 1- 
60. can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be 
deleted from the carboxy terminus of the secreted protein ormature form. 
Furthermore, any combination of the above amino and carboxy terminus deletions are 
preferred. Similarly, polynucleotide fragments encoding these polypeptide fragments 
are also preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and 
alpha-helix forming regions, beta sheet and bda^ sheet- forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated! ». . .. 

Other preferred fragments are' biologically active fragments. Biologically 
active fragments are those exhibiting activity similar, but not necessarily identical, to 
an activity of the polypeptide of the present invention! The biological activity of the' 
fragments may include an improved desired activity, or a decreased undesirable 
activity. , * . 
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* ' In the present invention," epitopes'* refer to polypeptide fragments having 

f antigenic or immunogenic activity in an animal, especially in a human. A preferred 
10 * * embodiment of the present invention relates to a polypeptide fragment comprising an 

epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
5 molecule to which an antibody can bind is defined as an "antigenic epitope." In 
contrast,' an "immunogenic epitope" is defined as a part of a protein that elicits an 
15 antibody response. '(See. for instance; Geysen et a]., Proc. Natl. Acad. Sci. USA 

81:3998- 4002 (1983).) 

Fragments which function as epitopes may be produced by any conventional 
2Q 10 means. (See. e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) 

,. further described in U.S. Patent No. 4,63 1, 2 II.) t , ;■ 

In the present invention, antigenic epitopes preferably contain a sequence of at 
t least seven, more preferably at least nine, and most preferably between about 15 to 
25 ■< v about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 

15 monocipnal antibodies, that specifically bind the epitope. (See, for instance, Wilson 
et al., Ceir 37:767-778 ( 1984); Sutciiffc, J. G. et al., Science 219:660-666 (1983).) * 

Similarly, immunogenic epitopes can be used to induce antibodies according 
to methods well known in the art. (See; for instance, Sutcliffe et al., supra; Wilson et 
al.. supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F J. el 
al., J. Gen. Vir6l. 66:2347-2354 (1985),) A preferred immunogenic epitope includes 
35 ; l ~ he secreted protein. The immunogenic epitopes may be presented together with a * 

carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, 
if it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to 
40 25 sufficient to raise antibodies capable of binding to, at the very least, linear epitopes 

in a denatured polypeptide (e.g., in Western blotung.) v 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
45 meant to include intact molecules as well as antibody fragments (such as, for 

example. Fab and F(ab')2 fragments) which are capable of specifically binding to 
30 protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 

•> more rapidly from the circulation, and may have less npn-specific tissue binding than ' 
50 . ah intact antibody, f Wahl et al., J. Nucl.Mcd.-24:3 16-325 (1983).) Thus, these 
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fragments are preferred, as well as the products of a FAB or other immunoglobulin. . 
expression library. Moreover, antibodies of the present-invention include chimeric, 
single chain, and humanized antibodies. 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion: *. 
proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
polypeptide of the present invention can be used to indirectly detect the second 
protein by binding to the polypeptide. Moreover, because secreted proteins target 
cellular locations based on trafficking signals, the polypeptides of the present 
invention can be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present 
invention include not only heterologous signal sequences, but alsoother heterologous 
functional regions. The fusion does not necessarily need to be direct, but may occur 
through linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics 
cf the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-tcrminus of the 
polypeptide to improve stability and persistence during purification from the host cell 
or subsequent handling and storage. Also, peptide moieties may be added to the , 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties to facilitate handling 
of polypeptides are familiar and routine techniques in the art. 

Moreover ; polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins , 
facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains * 
of mammalian immunoglobulins. .(EP A 394,827 ; Traunecker et al., Nature 331:84- ' 
.86 (1988),) Fusion proteins having disulfide-Jinked dimeric structures (due to the 
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IgG) can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountouiakis ct al„ J 
Biochem,270:3958r3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions' of constant region of immunoglobulin molecules . 
together with another human protein or partthereof In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in- for • 
example, improved pharmacokinetic properties: (EP-A 0232 262.) Alternatively, " 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if 
the fusion protein is used as an antigen for immunizations. In drug discovery, for. 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See/ 
D. Bennett et ah, J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., j. Biol. 
Chem. 270:9459-9471 (1995).) , t v 1 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. 
In preferred embodiments, the marker amino acid sequence is a hexa-hisudihe 
peptide, such as the tag provided in u pQE vector (Q1AGEN. Inc., 9259 Eton Avenue, 
Chatsworth, CA, 9131 1 ), among others, many of which are commercially available. 
As described in Gentz et ai., Proe. Natl. Acadi Sci. USA 86:821-824 (1989), for 
instance, hexa-histidme provides for convenient purification of the fusion protein/ ; 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al.. Cell 37:767 
(1984).) : 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

Vectors. Host Cells, anrj P rotein PrnrinrHnn 

The present invention also relates to vectors containing the polynucleotide of 
the present invention, : host cells, and the production of polypeptides by recombinant 
techniques. The vector may be. for example, a phage, plasmid, viral, or retroviral 
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vector. Retroviral vectors may be replication competent or replication defective. In 
the latter case, viral propagation generally will occur only in complementing host 
cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a piasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid.' If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
and then transduced into host cells. 

The polynucleotide insert should be operauvely linked to an appropriate 
■ promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to 
name a few. Other suitable promoters will be known to the skilled artisan. The - 
expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolatc reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillm 
resistance genes for culturing in E. coli and other bacteria. Representative examples 
of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture 
mediums and conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE7(), pQE60 and pQE- 
9, available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors. pNH8A, 
pNH16a, pNH18A, pNII46A, available from Stratagene Cloning Systems, Inc.; and ; 
ptrc99a t pKK223-3, pKK233-3, P DR540, pRIT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
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and pSG avaUable from Stratugene: and pSVK3, pBPV, pMSG and pSYL available 
from Pharmacia. Other suitable vectors wffl be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporauon, transduction, infection, or other methods. Such methods 
arc described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1 986).. It is specifically contemplated that the 
polypeptides of the present invention may in fact be expressed by a host cell lacking a 
recombinant vector. 

A polypeptide of this invention can be recovered and purified from - , 

recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of the present invention; and preferably the secreted form, can 
also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 
insect, and mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be glycosylated 
or may be non-glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as a result of host- 
mediated processes. Thus, it is well known in the art that the N-terminal methionine ' 
encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-tcrminal 
methionine on most proteins also is efficiently removed in most prokaryotcs, for some 
proteins, this prokaryotic removal process is inefficient, depending on the nature of 
the amino acid to which the N-terminal methionine is covalently linked. 
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In addition to encompassing host cells containing the vector constructs 
discussed herein, the invention' also encompasses primary, secondary, and 
immortalized host cells of vertebrate origin, particularly mammalian origin, that have 
been engineered to delete, or replace endogenous genetic material (e.g., coding 
sequence), and/of to include genetic material (e.g., heterologous polynucleotide 
sequences) . that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 
regions (e.g.. promoter and/or enhancer) and endogenous polynucleotide sequences 
via homologous recombination (see, e.g., U.S. Patent No. 5,641 ,670,, issued June .24, 
1997; International Publication No. WO 96/29411, published September 26, 1996; 
International Publication No. WO 94/12650, published August 4, 1994; Roller et al., 
Proc. 'Natl. 'Acad. Sci. USA i 86:8932-8935 (1989); and Zijlstra et al.. Nature 342:435- 
438 (1989), the disclosures of each of which are incorporated by reference in their 
entireties). , 

Uses Cfthc PoiVnurWirtP* 

Each of the' polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes : 
known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. , There exists an ongoing need to identify new chromosome markers, 
, since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are.presently available. Each polynucleotide of the present 
invention can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of 
somatic cell hybrids containing individual human chromosomes. Only those hybrids 
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containing the human gene corresponding to the SEQ ID NO:X will yield an 
amplified fragment 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides 
can be achieved with panels of specific chromosome fragments. Other gene mapping * 
strategies that can be used include in situ hybridization, prescreening with labeled 
flow- sorted chromosomes, and preselection by hybridization to construct 
chromosome specific -cDN A libraries. 

Precise chromosomal location of the polynucleotides can also be achieved - 
using fluorescence in situ hybridization (FISH) of a metaptiase chromosomal spread ■ 
This technique uses polynucleotides as short as 500 or 600 bases; however, 
polynucleotides 2,000-4,000 bp are preferred. For a review of this technique, see 
Verma et al., "Human Chromosomes: a Manual of Basic Techniques," Pergamon 
Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
mark a single chromosome or a single site on that chromosome) or in panels (for 
maiidng nu'tiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping.. 

Once a polynucleotide has been mapped to a precise chromosomal location, 
the physical position of the polynucleotide can be used in linkage analysis. Linkage * 
analysis establishes coinheritance between a chromosomal location and presentation 
of a particular disease. (Disease mapping data are found, for, example, in V. 
McKusick, Mendelian Inneritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) .) Assuming 1 megabase mapping resolution and 
one gene per 20 jjcb; a cDN A precisely localized to a chromosomal region associated 
with the disease could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 
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translocations, are examined in chromosome spreads or by HCR. If no structural 
alterations exist, the presence of point mutations are ascertained. Mutations observed 
in some or all affected individuals, but not in normal individuals, indicates that the ■•• 
mutation may cause the disease. However, complete sequencing of the polypeptide 
and the corresponding gene from several normal individuals is required to distinguish 
: the mutation from a polymorphism. If a new polymorphism is identified, this 
polymorphic polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the presenr invention. Any of these alterations (altered expression,* 
chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. "• , ' 

In addition to the foregoing, a polynucleotide can he used to control gene . ' 
expression through triple helix formation or antisense DNA or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, * 
preferred polynucleotides are usually 20 to 40 bases in length and complementary to 
either the region of the gene involved in transcription (triple helix see Lee et al., 
Nud. Acids Res. 6:3073(1979); Cooneyetal.. Science 241:456(1988); and Dervan . 
et al.; Science 25 1 : 1360 (1991) ) or to the mRNA itself (antisense - Okano, J. 
Ncurochem. 56:560 (1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene ; 
Expression, CRC Press, Boca Raton, FL (1988).) Triple helix formau on optimally 
results in a shut-off of RNA transcription from DNA^ while antisense RNA 
hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques are effective in model systems, and the information disclosed herein can 
he used to design antisense or triple helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 
manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a hew trait in the host cell. - . 
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The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example: is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel; . 
In this technique, an 'individual's genomic DNA is digested with one or more , 
restriction enzymes, and probed on a Southern, blot to yield unique bands for 
identifying personnel. This method does not suffer from the current limitations of 
"Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The polynucleotides of the present invention can be used as additional DNA \ 
markers for RFLP. ', 

The polynucleotides of the present invention can also be used as an alternative 
to RFLP. by determining the actual base -by-base DNA sequence of selected portions 
of an individual's genome. These sequences can be used to prepare PCR primers for « 
amplifying and isolating such selected DNA, which can then be sequenced. Using 
this technique, individuals can be identified because each individual will have a 
unique set of PNA sequences: Once an unique ID database is established for an 
individual, positive identification of that individual, living or dead, can be made from 
extremely small tissue samples': 

Fcrcr. sic biology also benefits from using DNA-based identification 
techniques as disclosed herein. DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, 
etc., can be amplified using PCR. In one prior art technique, gene sequences 
amplified from polymorphic loci, such as DQa class II HLA gene, are used in forensic 
biology to identify individuals/ (Erlich, H., PCR Technology, Freeman and Co. < 
(1992).) Once these specific polymorphic loci arc amplified, they are digested with 
one or more restriction enzymes, yielding an Identifying set of bands on a Southern 
blot probed with DNA corresponding to the DQa class II HLA gene. Similarly, * 
polynucleotides of the present invention can be used as polymorphic markers for 
forensic purposes. 

There is also a heed for reagents capable of identifying the source of a \ 
particular tissue. Such need arises, for example, in fdrensics when presented with ' 
tissue of unknown origin. Appropriate reagents can comprise, for example, DNA 
probes or primers specific to particular tissue prepared from the sequences of the 
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present invention. Panels of such reagents can identify tissue by species and/or by 
organ type. In a similar fashion, these reagents can be used to screen tissue cultures 
for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making ' 
oligomers for attachment to a "gene chip" or other support, to raise anti-DNA 
antibodies using DNA immunization techniques, and as an antigen to elicit an 
immune response. 

Uses of th e Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody- based techniques. For example, protein expression * 
in tissues can be studied with classical immunohistoiogical methods. (Jalkanen, M., 
et ah, J; Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
(ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (H2In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography. NMR or ESR. For X- . 
radiography, suitable labels include radioisotopes such as barium or cesium, which 
emit detectable radiation but arc not overtly harmful to the subject. Suitable markers 
for NMR and ESR include those with a detectable characteristic spin, such as 
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deuterium, which may be incorporated into the antibody by labeling of nutrients for 
the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311. 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear 
magnetic resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperttoneally) into the mammal. It will be understood in the an that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In the case of a radioisotope moiety, for a 
human subject, the quantity of radioactivity injected will normally range from about 5 ' 
to 20 millicurics of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. 
In vivo tumor imaging is described in S.W. Burchiel et al M "Irnmunopharmacokinetics 
of Radiolabeled Antibodies 'and Their Fragments." (Chapter 13 in Tumor Imaging: 
The Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., 
Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which 
involves (a) assaying the expression of a pblypepiide of the present invention in cells 
or body fluid of an individual; (b) comparing the level of gene expression with a 
standard gene expression level, whereby an increase or decrease in the assayed 
polypeptide gene expression level compared to the standard expression level is 
indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibiuhe activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 
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Similarly, antibodies directed to a polypeptide of the present invention can 
also be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 

• polypeptide. Similarly, administration of an antibody can activate the polypeptide, 
such as by. binding to a polypeptide bound lo a membrane (receptor). 

• At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
be used to raise antibodies, which in turn are used to measure protein expression from 
a recombinant cell, as a way of assessing transformation of the host cell. Moreover, : 
the polypeptides of the present invention can be used to test the following biological 
activities. 

Biological Activities 

The polynucleotides and polypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
may be involved. in the diseases associated with the biological activity. Thus, the 

• polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune . 
cells develop through a process called hematopoiesis, producing myeloid (platelets, 
red blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) 
cells from pluripotent stem cells. The etiology of these immune deficiencies or 
disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 
acquired (e.g., by chemotherapy or toxins), or infectious: Moreover, a polynucleotide 
or polypeptide of the present invention can be used as a marker or detector of a 
particular immune system disease or disorder. r 
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A polynucleotide or polypeptide of the present invention may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. A 
polypeptide or polynucleotide of the present invention could be used to increase ^ 
differentiation and proliferation of hematopoietic cells; including the pluripotent stem 
cells, in an effort to treat those disorders associated with a decrease in certain (or 
many) types hematopoietic cells. Examples of immunologic deficiency syndromes 
include, but are not limited'to: blood protein disorders (e.g. agammaglobulinemia, 
dysganunaglobulinemia), ataxia telangiectasia, common variable immunodeficiency, 
Digeorgc Syndrome, HIV infection, HTLV 7 BLV infection, leukocyte adhesion 
deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. ' 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate Hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 1 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, ;a polynucleotide or, polypeptide of the present invention that 
can decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 1 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes^ or scarring. 

A. polynucleotide or polypeptide of the present invention may also be useful in 
treating or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of 
the host tissue, therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T^cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the 
present invention include, but are not limited to: Addison's Disease, hemolytic 
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anemia, amiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 
encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, 
Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 
Pemphigus, Polyendocrinopathies. Purpura, Reiter's Disease, Stiff-Mari Syndrome, 
Autoimmune Thyroiditis, Systemic Lupus Erythematosus/Autoirnmune Pulmonary 
Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and 
. autoimmune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by a polypeptide 
or polynucleotide of the present invention. Moreover, these molecules can be used to 
treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood group.- 
incompatibility. . .. ' 

A polynucleotide or polypeptide of the present invention raay also be used to 
treatand/or prevent organ rejection or graft- versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an' 
immune response. Similarly, an immune response is also involved in GVHD. but, in * 
this case, the foreign transplanted immune cells destroy the host tissues. The 
administration of a polypeptide or polynucleotide of the present invention that inhibits; 
an immune response, particularly the proliferation, differentiation, or chemoiaxis of 
T-cells, may be an effective therapy in preventing organ rejection or GVHD. 

( Similarly, a polypeptide or polynucleotide of the present invention may also, 
be used to modulate inflammation. For example, the polypeptide or polynucleotide 
may inhibit the proliferation and differentiation of cells involved™ an inflammatory 
response. These molecules can be used to treat inflammatory conditions, hoth chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
• shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- ; 
reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chernokine induced lung injury, inflammatory bowel > 
disease, Crohn* disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) ' ' • ' ' ; 

Hvperproliferative Disorders 
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A polypeptide or polynucleotide can be used to treat or detect . 
hyperproliferative disorders, including neoplasms. A polypeptide or polynucleotide 
of the present invention may inhibit the proliferation of the disorder through direct or " 
indirect interactions. Alternatively, a polypeptide or polynucleotide of the present 
invention may; proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune, response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells } - hyperproliferative disorders can be treated. 
This immune response may be increased by either enhancing an existing immune 
response, or by initiating a new immune response. Alternatively, decreasing an * 
immune response may also be a method of treating hyperproliferative disorders, such 
as a chemotherapeutic agent. 

■. Examples of hyperproliferative disorders that can be treated or detected by a 
• polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive systemi liver, pancreas, 
peritoneum, endocrine glands (adrenal,, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated ur detected by 
a polynucleotide or polypeptide of the present invention. Examples of such / 
hyperproliferative disorders include, but are not limited to: 

hypergarnmaglobulinemia, lyrnphopioliferative disorders, paraproteinemias, purpura, V 
sarcoidosis, Sezary Syndrome, Waldenstrori s Macroglobulinemiu, Gaiicher's 
Disease, histiocytosis, and any other hyperproliferative disease, besides neoplasia, 
located in an organ system listed above. 

Infectious Pise^ . ' 

A polypeptide or polynucleotide of the present invention can be used to treat 
or detect infectious agents. For example, by increasing the immune response, 
particularly increasing the proliferation and differentiation of B and/or T cells, , 
infectious diseases may be treated. The immune response may be increased by either 
enhancing an existing immune response, or by initiating a new immune response. 
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Alternatively, the polypeptide or polynucleotide of the present invention may also 
directly inhibit the infectious agent without necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenpviridae, Arenaviridae, Arteri virus, 
Birnaviridae, Bunyaviridac, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpes viridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae. Morbilli virus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, 
Picomaviridae, Pox viridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Rctroviridae (HTLV-I, HTLV-II, Lentivirus). and Togaviridae (e.g., 
Rubi virus). Viruses failing within these families can cause a variety of diseases or , 
symptoms, including, but not limited to; arthritis, bronchiollitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox . hemorrhagic fever. Measles, Mumps, 
Parainfluenza. Rabies, the common cold, Polio, leukemia, Rubella, sexually 
transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and 
that can be treated or detected by a polynucleotide or polypeptide of the present 
invention include, but not limited to, the following Gram-Negative and Gram-positive 
bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceac (e.g., Anthrax, Clostridium). . 
Bacteroidaceae, Blastomycosis, Bordetella, Borrelia^ Brucellosis; Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocycoses. 
Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), Erysipelothrix, 
Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, Neisseriaccae 
(e.g;, Acinetobacter, Gonorrhea, MenigococcaJ), Pasteureliacea Infections (e.g., 
Actinobacillus, Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, 
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Chlamydiaceae, Syphilis, and Staphylococcal. These bacteria] or fungal families can 
cause the following diseases or symptoms, including, but not limited to: bacteremia, 
endocarditis, eye infections (conjunctivitis/tuberculosis, uveitis), gingivitis, 
opportunistic infections (e.g., AIDS related infections), paronychia* prosthesis-related 
infections, Reiter's Disease, respiratory tract infections, such as Whooping Cough or 
Empyema, sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, 
food poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
Diphtheria, Leprosy, Paraniberculosis, Tuberculosis, Lupus, Botulism, gangrene, 
tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin 
diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound • 
infections. A polypeptide or polynucleotide of the present invention can be used to 
ueat or detect any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, 
Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and 
Trichomonas. These parasites can cause a variety of diseases or symptoms, including, 
but not limited to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., 
dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g.. AIDS 
related), Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or,* 
polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. ' 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could cither be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient supplying the cells with a 
polynucleotide of the present invention, and returning the engineered cells to the 
patient (ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present 
invention can be used as an antigen in a vaccine lo raise an immune response against 
infectious disease. " 

• 'j ' ' ' ' 

Regeneration 
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A'polynucleotide or nolypeptide of the present invention can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 

\ replace, or protect tissue damaged by congenital defects, trauma (wounds, burns, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
reperfus ion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vasculature (including vascular and lymphatics), nervous, hematopoietic, • 

.and skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration 
occurs without or decreased scarring. Regeneration also may include angiogenesis. ■ 

■'. Moreover, a polynucleotide or polypeptide of the present invention may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would quicken recovery time after damage; A 
polynucleotide or polypeptide of the present invention could also be used 

- prophylactically in an effort to avoid damage. Specific diseases that could be treated 
include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. A 
further example of tissue regeneration of n on -healing wounds includes pressure 
ulcers, ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 
Similarly, nerve and brain tissue could also be regenerated by using a 

' polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases, associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous* system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 
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Chemotaxiis , / 

- A polynucleotide or polypeptide of the present in vention may have 
chemotaxis activity. A chemotaxic molecule attracts or mobilizes cells (eig., 
monocytes, fibroblasts, neutrophils, T^cells, mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body; such as inflammation, infection, or 
site of hypcrproliferatipn. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. ■ 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity'of particular cells. These chemotacuc molecules can then be used 
to treat inflammation, infection, hyperproliferati vc disorders, or any immune system 
disorder by increasing the number of cells targeted to a particular location in the body. - 
For example, chemotaxic molecules can be used to treat wounds and other "trauma to 
tissues by attracting immune cells to , the injured location. Chemotacuc molecules of 
• the present invention can also attract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
invention may inhibit chemotactic activity. These molecules could also be used to 
treat disorders. Thus, a polynucleotide or polypeptide of the present invention could 
be used as an inhibitor of chemciaxis. ' '* 

Bindinp A privity 

A polypeptide of the present invention may be used to screen for molecules 
that bind to the polypeptide or for molecules to which the polypeptide binds. The 
binding of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g.. receptors),or 
small molecules. ■ * ■ 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
orruflctionalirjirnetic. (See, Col igan et al.. Current Protocols in Immunology ' . 
l(2):Chapter5 (1991).) Similarly, the molecule can be closely related to the natural * 
receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
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of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 
10 , Preferably, the screening for these molecules involves producing appropriate 

cells which express the polypeptide, either as a secreted protein or on the cell 
5 membrane, ^referred cells include cells from mammals, yeast, Drosophila, or£ colt. 
Cells expressing the polypeptide (or cell membrane containingthe expressed 
15 ' . polypeptide) are then preferably contacted with a test compound potentially 

containing the molecule to observe binding, stimulation, or inhibition of activity of 
~ .' either the polypeptide or the molecule. » 
2Q 10 Thc assa y ma > sim pJy test binding of a candidate compound to the ^ 

polypeptide, wherein binding is detectedby a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the ' ' * 
candidate compound results in a signal generated by binding to the polypeptide. 
,25 r Alternatively, the assay can be carried out using cell-free preparations. 

.15 Hypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the polypeptide/molecule activity or binding to, a 
standard. ., ^ . . 1 

Preferably, an ELISA assay can measure polypeptide level or activity in a 
35 sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 1 

antibody can measure polypeptide level or activity by either binding, directly or ' - 
. >. . indirectly, to the polypeptide or by competing with the polypeptide for a substrate: 
All of these above assays can be used as diagnostic or prognostic markers. 
The molecules discovered using these assays can be used to treat disease or to bring . 
about a particular result in a patient (e.g., blood vessel growth) by activating or 
inhibiting the polypeptide/molecule. Moreover, the assays can discover agents which 
: : may inhibit or enhance the production of the polypeptide from suitably manipulated 
. cells or tissues. 

30 Therefore, the invention includes a method of identifying compounds which 

bind to a polypeptide of the invention comprising thc steps of: (a) incubating a 
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candidate binding compound with a polypeptide of the invention; and (b) determining 
if binding has' occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound 
* with a polypeptide of the invention, (b) assaying a biological activity , and (b) 
determining if a biological activity of the polypeptide has been altered. 

Other ActivitiP* 

A polypeptide or polynucleotide of the present invention may. also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. . ' 

A polypeptide or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color., skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 
used to modulate mammalian metabolism' affecting catabolism, anabolism, 
processing, utilization, and storage of energy. ; • * " ' 1 ■ ' " 

A polypeptide or polynucleotide of the present invention may be used to 
change a mammal; s menial state or physical state by influencing biortiythms, 
caricadic rhythms, depression (including depressive disorders), tendency for violence, 
tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-Iike 
activity), hormonal or endocrine levels! appetite, libido, memory, stress, or other 
cognitive qualities. > 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 

Other Purred Fmlm^imnn^ 

Other preferred embodiments of the claimed invention include ah isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% 
identical to a sequence of at least about 50 contiguous nucleotides in the nucleotide 
sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1. 
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Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5* Nucleotide of 
the Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Start Codon and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the 5' , 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the 
nucleotide at about the position of the 3* Nucleotide of the Clone Sequence as defined 
for SEQ ID NO:X in Table 1 . 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
seQMer.ce which is at least 95% identical to a sequence of at least about 150 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A further preferred embodiment is a nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of 
the First Amino Acid of the Signal Peptide and ending with the nucleotide at about 
the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X 
in Table 1. 

A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence of SEQ ID NO:X. 
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Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic 
acid molecule which hybridizes does not hybridize under stringent hybridization 
conditions to a nucleic acid molecule having a nucleotide sequence consisting of only 
A residues or of only T residues. , 

Also preferred is a. composition of matter comprising a DNA molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. " • 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in the nucleotide sequence of a human cDN A clone identified by a cDNA - 
Clone Identifier in Table 1, which DNA molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of 
at least 50 contiguous nucleotides is included in the nucleotide sequence of the 
complete oztz reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to sequence of at 
least 500 contiguous nucleotides in the nucleotide sequence encoded by said human 
cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule 
' comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is a method for detecting in a biological 
sample a nucleic acid molcculccomprising a nucleotide sequence which is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 
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wherein X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
. in the deposit with, the ATCC Deposit Number shown for said cDNA clone in Table 
1 1 which method comprises a step of comparing a nucleotide sequence of at least one 
nucleic , acid molecule in said sample with a sequence selected from said group and 
detennining whether the sequence of said nucleic acid molecule in said sample is at 
least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample' and a nucleic acid molecule comprising said sequence 
selected from said group. Similarly, also preferred is the above method wherein said : 
step of comparing sequences is performed by comparing the nucleotide sequence 
determined from a nucleic acid molecule in said sample with said sequence selected 
from said group. . The nucleic acid molecules can comprise DNA molecules or RNA 
molecules. 

A further preferred embodiment is a method for identifying the species, tissue 
or cell type of a biological sample which method comprises a step of detecting nucleic 
acid molecules in said sample, if any, comprising a nucleotide sequence.that is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 
wherein X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1. ' • ,. 

The method for identifying the species, tissue or cell type of a biological 
sample can comprise a step of detecting nucleic acid molecules comprising a 
nucleotide sequence in a panel of at least two nucleotide sequences, wherein at least 
one sequence in said panel is at least 95% identical to a sequence of at least 50 , 
contiguous nucleotides in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Tabic 1 , which method comprises a step of detecting in a 
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biological sample obtained from said subject nucleic acid molecules, if any, 
comprising a nucleotide sequence that is at least 95% identical to a sequence of at 
least 50 contiguous nucleotides in a sequence selected from the group consisting of: a " 
nucleotide sequence of SEQ ID NO: X wherein X is any integer as defined in Table 1; 
arid a nucleotide sequence encoded by a human cDNA clone identified by acDNA V- 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit V * ' 
Number shown for, said cDNA clone in Table 1. • 

v The methodYor diagnosing a pathological condition can comprise a step of 

' detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at 
least two nucleotide sequences, wherein at least one sequence in said panel is at least *. 
;95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from said group. V 

; Also preferred is a composition of matter comprising isolated nucleic, acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise J: 
a panel of at least two nucleotide sequences, wherein at least one sequence in said 
panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a ' 
sequence selected from' the group consisting of: a nucleotide sequence of SEQ ID ' 
NO:X wherein X is any integer as defined in Table 1; and a nucleotide sequence 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Tabic 1 : 

' contained in the deposit with the ATCC Deposit Number shown for said cDNA ' . 
clone in Table I . The nucleic acid molecules can comprise DNA molecules or RNA •/ 
molecules. "_ i : , , /' ' " 

Also preferred is an isolated polypeptide comprising an . amino acid seq uence : J f : 
at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1. 

Also preferred is a polypeptide, wherein said sequence of contiguous amino . , , 

' acids is included in the amino acid sequence of SEQ ID NO; Y in the range of ; 
positions beginning with the residue at about the position of the First Amino Acid of ' 1 ,• 
the Secreted Portion arid ending with the residue at about the Last Amino Acid of the 
Open Reading Frame as set forth for SEQ ID NO: Y hi Table 1. 
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Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID MO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to a sequence of at least about 1 00 contiguous amino 
acids in the amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to die complete amino acid sequence of SEQ ID 
NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 90% identical to a sequence of at least about 10 contiguous amino 
acids in the complete amino acid sequence of a secreted protein encoded by a human 
cDNA cione identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included. in the amino acid sequence of a secreted portion of the secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Tab!e 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous amino acids in 
the amino acid sequence of the secreted portion of the protein encoded by a human 
cDNA clone identified by, a cDNA Clone Identifier in Table I and contained in the 
deposit with the ATCC Deposit Number shown tor said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide com prising an amino acid sequence 
at least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 



WO 00/06698 PCT/US99/17130 

.' . . ' ' • 30t ' . / 

and contained in the deposit with the ATCC Deposit Number shown for said cDNA 1 
clone in Table 1. 

Further preferred is an isolated antibody which binds specifically to a 
polypeptide comprising an amino acid sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group \ 
consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
• 1. ■ , . ' ' " • [ . 

Further preferred is a method for detecting in a biological sample a 
polypeptide comprising an amino acid sequence which is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Tabic 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDN A clone identified by a cDN A Clone Identifier in Table 1 and contained . 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1 ; which method comprises a step of comparing an amino acid sequence of at least 
one polypeptide molecule in said sample with a sequence selected from said group 
and determining whether the sequence of said polypeptide molecule in said sample is 
at least 90% identical to said sequence of at least 10 contiguous amino acids. * 

Also preferred is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of < - - 
polypeptides in said sample to an antibody which binds specifically to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: 
an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in 
Table 1 ; and a complete amino acid sequence of a protein encoded by a human cDNA 
■ clone identified by a cDNA Clone Identifier in Tabic ,1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDN A clone in Table 1. 
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Also preferred is the above method wherein said step of comparing sequences 
is performed by comparing the amino acid sequence determined from a polypeptide - 
molecule in said sample with said sequence selected from saM group. 

Also preferred is a method for identifying the species, tissue or cell type of a 
biological sample which method comprises a step of detecting polypeptide molecules 
in said sample, if any, comprising an amino acid sequence that is at least 90% 
identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is 
any integer as defined in Table 1; and a complete amino acid sequence of a secreted , 
protein encoded by: a human cDNA clone identified by a cDNA Clone Identifier in - 
Table I and contained in the deposit with the ATCC Deposit Number shown for said * 
cDNA clone in Table 1; 

Also preferred is the above method for identifying the species, tissue or cell 
type of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a pane! of at least two amino acid, 
sequences* wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. ^ < ■ ' : , ■ > . ' . • 

Also preferred is a method fur diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Table I , which method comprises a step of detecting in a 
biological sample obtained from said subject polypeptide molecules comprising an 
amino acid sequence in a panel of at least two amino acid sequences, wherein at least 
one sequence in said panel is at least 90% identical to a sequence of at least 10 
contiguous amino acids in a sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in, Table 1 and contained in the deposit with the . 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

In any of these methods, the step of detecting said polypeptide molecules f 
includes using an antibody. 
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Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at 
least 90% identical to a sequence of at least * 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO:Y 
wherein Y is any integer as defined in Table I ; and a complete amino acid sequence 
of a secreted protein encoded by a human cDNA clone identified by a cDNA Clone 
Identifier in Tabic 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide- 
sequence encoding a polypeptide has been optimized for expression of said 
polypeptide in a.prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 : and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Gone Identifier in Table 1 and contained in the deposit with the 
ATCC !>e|>osii Number shown. for said cDNA clone in Table 1 . 

Further preferred is ,a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred 
is the recombinant vector produced by this method. Also preferred is a method of 
making a recombinant host cell comprising introducing the vector into a host cell, as 
well as mc recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 
expressed and recovering said polypeptide: Also preferred is this method of making 
an isolated polypeptide, wherein said recombinant host ceil is a eukaryotic cell and 
said polypeptide is a secreted portion of a human secreted protein comprising an 
amino acid sequence selected from the group consisting of: an amino acid sequence of 
SEQ ID NO: Y beginning with the residue at the position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and 
said position of the* First Amino Acid of the Secreted Portion of SEQ ID NO:Y is 
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defined in Table 1 ; and an amino acid sequence of a secreted portion of a protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table I. The isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an 
increased level of a secreted protein activity, which method comprises administering 
to such an individual a pharmaceutical composition comprising an amount of an 
isolated polypeptide, polynucleotide, or antibody of the claimed invention effective to 
increase the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Examples 

Example 1: Isolation of a Selected cDNA Clone From the Deposited Sam ple 

Each cDNA clone in a cited ATCC deposit is contained in a plasriiid vector. 
Table 1 identifies the vectors used to construct the cDNA library from which each 
clone was isolated. In many cases, the vector used to construct the library is a phage 
vector from which a pJasmid has been excised. The table immediately below 
correlates the related plasmid for each phage vector used in constructing the cDNA 
library. For example, where a particular clone is identified in Table I as being 
isolated in the vector "Lambda Zap," the corresponding deposited clone is in 
"pBluescript."/ 

Vector Used to Construct Library Corresponding Deposited 

Plasmid 

Lambda Zap ' pBluescript (pBS) 

' Uni-ZapXR pBluescript (pBS) 

Zap Express . pBK 

lafmid BA plafmid BA 

pSportl pSpprtl 
pCMVSport 2.0 pCMVSport 2.0 
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pCMVSport-3.0 ' pCMVSport 3.0 ' v 

pCR*2. 1 ' pCR 8 2.I • 

Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5, 1 28,.256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128.256 and 5,286,636), pBluescript (pBS) (Short, J. M. et a]., Nucleic Acids Res. 
16:7583-7600 ( 1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M.'A- .etal., Strategies 5:58-61 (1992)) are 
commercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey 
Pines Road, La Jolla, GA, 92037. pBS contains an ampicillin resistance gene and 
pBK'contains a neomycin resistance gene. , Both can be transformed into E. coli strain 
XL- 1 Blue, also available from Stratagene. pBS comes in 4 forms SK+ f SK-. KS+ 
and KS. The S and K refers to the orientation of the poly linker to the T7 and T3 
primer sequences which flank the poly linker region ("S" is for SacI and "K" is for 
Kpnl which are die first sites on each respective end of the linker). "+" or "-" refer to 
the orientation of the f 1 origin of replication ("ori"), such that in one orientation, 
single stranded rescue initiated from the f 1 pri generates sense strand DMA and in the 
other, antisense. 

Vectors pS port 1, pCMVSport 2.0 and pCMVSport 3.0, were obtained from * 
Life Technologies, Inc., P.O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
contain an ampicillin resistance; gene and may be transformed into E. coli strain 
DH10B, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al., Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, 
NY) contains an ampicillin resistance gene and can be transformed into E. coli strain 
XL- 1 Blue. Vector pCR?2. 1 1 which is available from Invitrogen, 1600 Faraday 
Avenue, Carlsbad, CA 92008, contains an ampicillin resistance gene and may be 
transformed into E. coli strain DHI0B, available from Life Technologies. (See, for 
instance. Clark, J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., 
Bio/Technology 9: ( 1 99 1 ). ) Preferably, a polynucleotide of the present invention 
does not comprise the phage vector sequences identified for the particular clone in 
Table 1 , as well as the corresponding plasm id vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number 
cited in Table 1 for any given cDNA clone also may contain one or more additional 
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plasrruds, each comprising a cDN A clone different from that given clone. Thus, 
deposits sharing.the same ATCC Deposit Number contain at least a plasmid for each 
cDNA clone identified in Table 1. Typically, each ATCC deposit samplecited in 
Table 1 comprises a mixture of approximately equal amounts (by weight) of about 50 
plasmid DNAs, each containing a different cDNA clone; but such a deposit sample 
may include plasmids for more or less than 50 cDNA clones, up to about 500 cDNA 
clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone in Table 1 . First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ 
ID NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized •'. 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with "P-^ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis ct.al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY ( 1982).). 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL- 1 Blue (Stratagerie)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The u-ansformants are plated on 1.5% agar plates (containing the appropriate selection 
agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
. These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g., Sambrook etal., Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1 .93 to 
T. 104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3' NT of the, clone defined in Table .1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain 
reaction is carried out under routine conditions, for instance, in 25 fj.1 of reaction 
mixture with' 0.5 ug of the above cDN A template. A convenient reaction mixture is , 
1 .5-5 mM MgCU 0.0 1 % (w/y) gelatin. 20 |aM each of d ATP. dCTP, dGTP, dTTP. 25 
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pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 
(denaturation at 94°C for 1 min; annealing at 55°C for t min; elongation at 72°Q for 1 
min) are performed with a Perkin-Elrner Getus automated thermal cycler. The 
amplified product is analyzed by agarose gel electrophoresis and the DN A band with 
expected, molecular weight is excised and purified, the PCR product is verified to be 
the selected sequence by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are well . 
known in the art. For instance, a method similar to 5' RACE is available for 
generating the missing 5' end of a desired full-length transcript. (Fromont-Racine et 1 
a!., Nucleic Acids Res. 2 1(7): 1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a 

. population of RNA presumably containing full-length gene RNA transcripts. A 
primer set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the gene of interest is used to PCR amplify 
the 5' portion of the desired full-length gene. This amplified product may then be . 
sequenced and used to generate the full length gene. 

This above method starts with total RNA isolated from the desired source; 1 
although poly-A+ RNA can be used. The RNA preparation can then be treated with 

: phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged ■ 
RNA which may interfere with the later RNA ligase step, the phosphatase should 
then be inactivated and the RNA treated with tobacco acid pyrophosphatase in order 
to remove the cap structure present at the 5' ends of messenger RN As. This reaction 
leaves a 5' phosphate group at the 5' end of the cap cleaved RNA which can then be t 
ligated to an RNA oligonucleotide using T4 RNA ligasc. 

This modified RNA preparation is used as a template for first strand cDN A 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is, 

/used as a template for PCR amplification of the desired 5' end using a primer specific 
to the ligated RNA oligonucleotide and a primer specific to the known sequence of 
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the gene of interest. The resultant product is then sequenced and analyzed to confirm 
that the 5* end sequence belongs to the desired gene. h 

Example 2: Isolation of Genomi c Clones Corresponding to a Polynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X„ 
according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, 
among others, Sambrook et al. For example, a cDN A probe. produced by the method 
described in Example 1 is labeled with P 32 using the rediprime™ DNA labeling 
system (Amersham Life Science), according to manufacturer's instructions. After 
labeling, the probe is purified using CHROMA SPIN-100™ column (Clontech 
Laboratories, Inc.), according to manufacturer's protocol number PT1200- 1. The 
purified labeled probe is then used to examine various human tissues for mRNA 
expression 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) 
or human immune system tissues (IM) (Clontech) are examined with the labeled 
probe using ExprcssHyb™ hybridization solution (ClonLech) according to 
manufacturer's protocol number PT1 190-1. Following hybridization and washing, the 
blots are mounted and exposed to film at -70 C C overnight, and the films developed 
according to standard procedures. 

Example 4: Chromoso mal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
conditions : 30 seconds, 95°C;,1 minute, 56°C; 1 minute, 70°C. This cycle is 
repeated 32 times followed by one 5 minute cycle at 70°C. Human, mouse, and 
hamster DNA is used as template in addition to a somatic cell hybrid panel containing 
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individual chromosomes or chromosome fragments (Bios, Inc). The reactions is 
analyzed on either 8% poly aery lamide gels or 3.5 % agarose gels. Chromosome 
mapping is determined by the presence of an approximately i 00 bp PCR fragment in 
the particular somatic cell hybrid. * . 

Example 5: Bacterial Expression of a Polypeptide 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5* and 3"ends of the DNA ' 
sequence, as outlined in Example 1, to synthesize insertion fragments. Tlie primers 

. used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 
into the expression vector. For example, BamHI and Xbal correspond to the 
"restriction enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., 
Chatsworth, C A). This plasmid vector encodes antibiotic resistance (Amp 1 "), a 

, bacterial origin of replication (ori), an IPTG-regulatable promoter/operator,(P/0), a 
ribosome binding site (RBS)/a6-histidine tag (6-His), and restriction enzyme cloning 
sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified 
fragment is ligated into the pQE-9 vector maintaining the reading frame initiated at 
the bacterial RBS. The ligation mixture is then used to transform the E. coli strain 
M15/rep4 (Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which 
expresses the lacl repressor and also confers kanamycin resistance (Kan 1 ) 
Transformarils are identified by their ability to grow on LB plates and 
ampicillinAanarnycin resistant colonies are selected. Plasmid DNA is isolated and * 
confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp ( 100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1 : 100 to 1 :250. The 
cells are grown to an optical density 600 (O.D. 6 ^*) of between 6.4 and 0.6. IPTG , 
(Isopropyl-B-D-iliiogalacto pyranoside) is then added to a final concentration of 1 
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mM. IPTG induces by inactivating the lacl repressor, clearing the P/O leading to 
increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
agent 6 Molar Guanidine HCi by stirring for 3-4 hours at 4°C. The cell debris is 
removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
Q1AGEN, Tnc ; supra). Proteins with a 6 x His tag bind to the Ni-MTA resin with 
high affinity and can be purified in a simple one-step procedure (for details see: the 
QIAexpresMonist (1995) QIAGEN, Inc., supra). , . \ 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 
8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then 
washed with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is 
eluied with 6 M guanidine-HCl, pH 5. . 

The purified protein is then renatured by dialyzing it against phosphate- 1 
buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. 
Alternatively, the protein can be successfully refolded while immobilized on the Ni- 
NTA column. The recommended conditions are as follows: renature using a linear 
6M- 1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The renaturation should be performed over a period of 
1 .5, hours or more. After renaturation the proteins are eluted by the addition of 250 
mM irnrhidazole. Immidazole is removed by a final dialyzing step against PBS or 50 ■ 
mM sodium acetate pH 6 buffer plus 200 mM .NaCl. The purified protein is stored at 
4 a CorfTOzen.at-80°C. ' . 

In addition to the above expression vector, the present invention further 
includes an expression vector comprising phage operator and promoter elements 
operatively linked to a polynucleotide of the present invention, called pHE4a. (ATCC ' 
Accession Number 209645, deposited on February 25, 1998.) This vector contains: 
1 ) a neomyc ^phosphotransferase gene as a selection marker. 2) an E. coli origin of 
replication, 3) a T5 phage promoter sequence. 4) two lac operator sequences ♦ 5) a 
Shine-Delgarno sequence, and 6)" the lactose ope ron repressor gene (laclq). The 



WO 00/06698 \ PCT/US99/17130 

' " ; ' 311 ' ■ ' • 

origin of replication (oriC) is derived from pUC!9 (LTI, Gaithcrsburg, MD). The 
promoter sequence and operator sequences are made synthetically. 
r , . , DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal. BamHI, Xhol. or Asp718, running the restricted product oh a gel, and isolating 
the larger fragment (the stuffer fragment should be about 3 10 base pairs). The DNA 
insert is generated according to the PCR protocol described in Example 1. using PCR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (V primer). The PGR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacteria] system. 

Example 6; Purifica tion of a Polypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide 
expressed in E coli when it is present in the form of inclusion bodies. Unless ' 
. otherwise specified, all of the following steps are conducted at 4- 1 0°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled lo 4-10°C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics. Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5 M 
NaCl, 100 inM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are soluhilized with 1.5 M guanidine 
hydrochloride (GuHCl).for 2-4 hours. After 7000 xg centrifugation for 15 min., the 1 
pellet is discarded and the polypeptide containing supernatant is incubated at 4°C . 
overnight to allow further GuHCI extraction. 
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Following high speed centrifugauon (30.000 xg) to remove insoluble particles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5. 150 mM NaCl, 2 mM EDTA. 
by vigorous stirring. The refolded diluted protein solution is kept at 4°C without 
mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 \im membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. t The . 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, -Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.6 and eluted . 
with 250 mM, 500 rnM. 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 280 nm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of 
tandem columns of strong anion (Poros HQ- 50, Perseptive Biosystems) and weak 
anion (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are 
equilibrated with 40 rnM sodium acetate. pH 6.0. Both columns are washed with 40 
mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using ' 
a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
acetate, pH 6.0 lo 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are' ' 
collected under constant A :80 monitoring of the effluent. Fractions containing the 
polypeptide (determined, for instance, by 1 6% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the 
c above refolding and purification steps. No major contaminant bands should be 
observed from Cornrnassie blue stained 16% SDS-PAGE gel when 5 M-g of purified 
protein is loaded. The purified protein can also be tested for endotoxin/LPS 
contamination, and typically the LPS content is less than 0.1 ng/mi according to LAL 
■assays. , , . •. 
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Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a 
polynucleotide into a baculovirus to express a polypeptide. This expression vector 
contains The strong polyhednn promoter of the Autographa californica nuclear 
polyhedrosis virus (AcMNPV) followed by convenient restriction sites such as 
BamHI, Xba I and Asp7l8. The polyadenylation site of the simian virus 40 ("SV40") 
is used for efficient polyadenylation. For easy selection of recombinant virus, the 
plasmid contains the beta-galactosidase gene from E. coli under control of a weak 
Drosophila promoter in the same orientation, followed by the polyadenylation signal 
of the polyhedrin gene. The inserted genes are flanked on both sides by viral 
sequences for cell-mediated homologous recombination with wild-type viral DNA to 
generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL94 1, and pAcIMl, as one skilled in the art would readily appreciate, 
as long as the construct provides appropriately located signals for transcription, 
translation,, secretion and the like, including a signal peptide and an in-frame AUG as 
required. Such vectors are described, for instance, in Luckow et al.. Virology 170:3 1 - 
39(1989). 

, Specifically, the cDNA sequence contained in the deposited clone, including 
the AUG initiation codon and the naturally associated leader sequence identified in 
Table 1, is amplified using the PCR protocol described in Example I. If the naturally 
occurring signal sequence is used to produce the secreted protein, the pA2 vector does 
not heed a second signal peptide. Alternatively, the vector can be modified (pA2 GP) 
to include a baculovirus leader sequence, using the standard methods described in 
Summers et al., "A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures/' Texas Agricultural Experimental Station Bulletin No. 1555 
(1987). 

The amplified fragment is isolated from a \% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc.. La Jolla. Ca.)- The fragment 
then is digested with appropriate restriction enzymes and again purified on a 1 % 
agarose gel. * 
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The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine ' 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a . 
commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. culi HB 101 or other suitable E. coli hosts such as XL- 1 Blue 
(Stratagene Cloning Systems. La JoIIa, CA) cells are transformed with the ligation 
* mixture and spread on culture plates., Bacteria containing the plasmid arc identified 
by digesting DNA from individual colonies and analyzing the digestion product by 
gel electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

Five jig of a plasmid containing the polynucleotide is co-transfected with 1.0 
ug of a commercially available linearized baculovirus DNA ("BaculoGold™ 
baculovirus DNA", Pharmingen, San Diego, CA), using the lipofection method 
described by Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 ( 1987), One jig 
of BaculoGold™ virus DNA and 5 ug of the plasmid are mixed in a sterile well of a 
microliter plate containing 50 |xl of serum- free Grace's medium (Life Technologies 
Inc., Gaithersburg, MD). Afterwards, 10 \il Lipofectin plus 90 u.1 Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the 
transfection mixture is added drop- wise to Sf9 insect cells (ATCC CRL 171 1) seeded 
in a 35 mm tissue culture plate with 1 ml Grace s medium without serum. The plate is 
then incubated for 5 hours at 27° C. The transfection solution is then removed from 
the plate and I ml of Grace's insect medium supplemented with 10% fetal calf serum • 
is added. Cultivation is then continued at 27° C for four days. 

After four days the supernatant is collected and a plaque assay is performed, 
as described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of ' 
gal-cxpressing clones, which produceblue- stained plaques. (A detailed description of 
a "plaque assay'' of this type can also be found in the user s guide for insect cell 
culture and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 
9-10.) After appropriate incubation, blue stained plaques are picked with the Lip of a 
micropipettor (e.g., Eppenuorf). The agar containing the recombinant viruses is then 
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resuspended in a microcentrifuge tube containing 200 jjJ of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded 
in 35 mm dishes. Four days later the supernatants of these culture dishes are 
^harvested and then they are stored at 4° C. 

To verify the expression of the polypeptide. Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with 
the recombinant baculovirus containing the polynucleotide at a multiplicity of 
, infection ("MOI") of about 2. If radiolabeled proteins arc desired; 6 hours later the 
medium is removed and is replaced, with SF900 II medium minus methionine and 
cysteine (available from Life Technologies Inc., Rockville, MD). After .42 hours, 5. 
jiCi of 33 S- methionine and 5 u;Ci 35 S -cysteine (available from Amersham) are added. 
The cells.are further incubated for 16 hours and then are harvested by centrifugation. 
The proteins in the supernatant as well as the intracellular proteins are analyzed by 
SDS-P AGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be'used to determine the aminb terminal sequence of the 

y ■ _ 

produced protein. 

Example fi: Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a mammalian 
cell. A typical mammalian expression vector contains a promoter element, which 
mediates the initiation of transcription of mRNA. a protein coding sequence, and 
signals required for the termination of transcription and polyadenylation of the 
transcript. Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for RNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from SV40, the long / , 

terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLV1, HIVI and the early 
promoter of the cytomegalovirus (CMV). However, cellular elements can also be 
used (e.g., the human actin promoter)/ 

> Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2tihtr (ATCC 37J46), pBC12Ml (ATCC 67109), 
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pCMVSport 2.0, and. pCMVSport 3.0. Mammalian host cells that could be used 
include, human Heia, 293, H9 and Jurkat cells; mouse NIH3T3 and C127 cells, Cos l t 
Cos 7 and CVI, quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. ' . .' ' . 

Alternatively, the polypeptide can be expressed in stable cell lines containing 
the polynucleotide integrated into a chromosome . The co-transfection with a 
selectable marker such as dhfr, gpt. neomycin, hygromycin allows the identification 
and isolation of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt, F. W., etal.. J. Biol. Chem. 253: 1357-1370 (1978); Hamlin, J. 
L. and Ma, C; Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991 ).) Another useful selection marker 
' is the enzyme glutamine synthase (CiS) (Murphy et al., Biochem J. 227:277-279 
(1991); Bebbingtoh et al., Bio/technology 10:169-175 (1992). Using these markers, 
the mammalian cells are grown in selective medium and the cells with the highest 
resistance are selected. These cell lines contain the amplified gene(s) integrated inura 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. t - > 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors' pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No. 209647) contain' the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al.. Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV enhancer (Bdshart ct al!, Ceil 41:521-530 ( 1985).) Multiple cloning sites, e.g., > 
with the restriction enzyme cleavage sites BamHl, Xbal and Asp718, facilitate the 
cloning of the jgene of interest. The vectors also contain the 3* intron, the 
poJyadenyiation and termination signal of the rat preproinsulin 'gene, and the mouse 
DHFR gene under control of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1 % agarose gel. 
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A polynucleotide of the present invention is amplified according to the 
protocol outlined in Example 1. If the naturally occurring signal sequence is used to 
produce the secreted protein, the vector docs not need a second signal peptide. 
Alternatively , if the naturally occurring signal sequence is not used, the vector can be 
\ 5 modified to include a heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1% agarose gel using a 
15 commercially available kit ("Geneclean." BIO 101 Inc., La iolla, Ca.). The fragment 

then is digested with appropriate restriction enzymes and again purified on a 1% 
agarose gel. 

10 The amplified fragment is then digested with the same restriction enzyme and 

20 ■ . * 

purified on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector 

are then ligated with T4 DNA ligase. E. colt HB 101 or XL-t Blue cells are then 

transformed and bacteria are identified that contain the fragment inserted into plasmid 

25 . pC6 using, for instance, restriction enzyme analysis. 

15 . Chinese hamster ovary cells lacking an active DHFR gene is used for 

transfection. Five (ig of the expression plasmid pC6 is cotransfected with 0.5 u.g of 

the plasmid pSVneo using lipofectin (Feigner et al M supra). The plasmid pSV2-neo 

30 contains a dominant selectable marker, the neo gene from TnS encoding an enzyme 

that confers resistance to a group of antibiotics including G418. The cells are seeded , 

20 in alpha minus MEM supplemented with 1 mg/ml G4 1 8. After 2 days, the cells are * 

^ trypsinized and seeded in hybridorna* cloning plates (Greiner, Germany) in alpha 

minus MEM supplemented with 10, 25. or 50 ng/ml of metothrexate plus 1 mg/ml 

G41 8. After about 10-14 days single clones are trypsinized and then seeded in 6-weIl 

petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM; 

40 25 100 nM, 200 nM, 400 nM. 800 nM). Clones growing at the highest concentrations of \ 

methotrexate are then transferred to new.6-well plates containing even higher 

concentrations or methotrexate (1 |LlM. 2 fiM, 5 nM, 10 mM. 20 mM). The same 

procedure is repeated until clones are obtained which grow at a concentration of 100 - 

- . 200 |iM. Expression of the desired gene product is analyzed, for instance, by SDS- 

30 PAGE and Western blot or by reversed phase HPLC analysis. 
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Example 9: Protein Fusions 
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The polypeptides of the present invention are preferably fused to other 
proteins. These fusion proteins can be used for a variety of applications. For 
example, fusion of the present polypeptides to His-iag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 5; see 
also EP A 394,827; Traunecker, et al., Nature 33 1:84-86 (1988).) Similarly, fusion to 
IgG- 1, IgG-3, and albumin increases the halflife time in vivo. Nuclear localization ' 
signals fused to the polypeptides of the present invention can target the protein to a ^ 
specific subcellular localization, while covaient hctcrodirner or homodimers can 
increase or decrease the activity of a fusion protein. Fusion proteins can also create 
chimeric molecules having more than one function'. Finally, fusion proteins can ' . ■ 
.increase solubility and/or stability of the fused protein compared to the non-fused 
protein. All of the types, of fusion proteins described above can be made by 
modifying the following protocol, which outlines the fusion of a polypeptide to an 
IgG molecule, or the protocol described in Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PGR amplified, 
using primers that span the 5* and 3' ends of the sequence described below. These 
primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion 
can be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with, 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PGR protocol described in Example I, is ligated into this BamHI site! Note 
that the polynucleotide is cloned without a stop codon, otherwise a fusion protein will * 
not be produced. . 

. If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See. e.g.. WO 96/3489 1.) 



Human IgG Fc region: 
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GGGATCCGGAGCCC.^^TCTTCTGACAAAACTCACACATGCCCACCGTGC 

CCAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCITCCCCCCAA^ 

CCCAAGGACACCGTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGT 

GGT(UG ACGTAAGCCACG A AG ACCCTG AGGTCAACTTC AACTGGTACGTGG 

ACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGGAGTA 

CAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACT 

(XKrTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCGCA 

ACCCCCATCGAGAAAACCATCTGCAAAGCCAAAGGGCAGCCCCGAGAAC 

GACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAG 

GTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGT 

GGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCt 

CCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTG 

GACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCA 

TGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

GTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1) 

Example 10: Producti on of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter !)' For example, cells expressing a 
polypeptide of the present invention is administered to an animal to induce the , 
production of sera containing polyclonal antibodies. In a preferred method, a 
preparation of the secreted protein is prepared and purified to render it substantially 
free of natural contaminants. Such a preparation is then introduced into an animal in 
order to produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al., Nature 
256:495 (1975); Kohler et al., Eur. J. ImmunoL 6:511 (1976); Kohler eta!., Eur. J. ' 
Immunol. 6:292 (1976); Hammerling et ai.. in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier. N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
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any suitable tissue culture medium; however, ii is preferabie to culture ceils in Earie's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 
.about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1 ^000 U/ml of penicillin, and about 100 jig/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma 
cells obtained through such a selection are then assayed to identify clones which 
secrete ami bodies, cap able of binding the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can 
be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the. fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an ami body 
whose ability to bind to the protein-specific antibody can be blocked by the 
polypeptide. Such antibodies comprise anti-idiotypic antibodies to the protein- 
specific antibody and can be used to immunize an animal to induce formation of 
further protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies of the present invention may be used according to the methods disclosed 
herein. Such fragments are typically produced by proteolytic cleavage, using 
enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 
fragments). Alternatively, secreted protein-binding fragments can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry. ■ 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
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using genetic constructs derived from hybridoma ceils producing the monoclonal 
antibodies described above. ■ iMethods for producing chimeric antibodies are known in 
the art. (See, for review, Morrison, Science 229:1202 (1985); Oi et al., 
BioTechniques 4:214 (1986); Cabilly et ah, U.S. Patent No. 4,816,567; Taniguchi et v 
al., EP 171496; Morrison el id., EP 173494; Neubergeret al, WO 8601533; Robinson 
ct al., WO 8702671; Boulianne et al.. Nature 312:643 (1984): Neubergef et al., Nature 
314:268(1985).) - . < 

Example 1 1 : Production Of Secreted Protein For High.Throuffhput Screening 
Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
Examples 13-20. ' 

First, dilute- Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(tmg/ml in PBS) 1 :20 in PBS (w/o calcium or magnesium 17-516F Biowhittakcr) for 
a working solution. of 50ug/ml. Add 200 ul of this solution to each well (24 well 
plates) and incubate at RT for 20 minutes. Be sure to distribute the solution over each 
well (note: a 1 2-channeI pipetter may be used with tips on every other channel). 
Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered- 
Saline). The PBS should remain in the well until just prior to plating the cells and 
plates may be poly- lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10* cells/well in .5ml 
DMEM(Dulbecco\s Modified Eagle Medium)(wtth 4.5 G/L glucose and L-glutamine 
(12-604FBiowhittaker))/l0% heat inactivated FBS(14-503F BiowhittakerVlx 
Penstrep( 1 7-602E Biowhittaker). Lei the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofcctamine 
( 1 8324-0 1 2 Gibco/B RL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 
With a small volume multi-channel pipetter, aliquot approximately 2ug of an 
expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 8 or 9, into an appropriately labeled 96-well round bottom 
plate. With a multi-channel pipetter. add 50ul of the Lipofectamine/Optimem I 
mixture to each well. Pipette up .and down gently to mix. Incubate at RT 15-45 
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minutes. After about 20 minutes, use a multi -channel pipetter to add 150ul Optimcm 
I to each well As a control, one plate of vector DNA lacking am insert should be 
transfected with each set of transfections. 

' 4 Preferably, the trans fecti oh should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too * , 
much time oh PBS. First, person A aspirates off the media from four 24- well plates 
of cells, and then person B rinses each well with .5- 1ml PBS. Person A then aspirates 
off PBS rinse, and person B, using al 2 -channel pipetter with lips on every other 
channel, adds the 200ul of DNA/Lir^fectairune/Optimem I complex to the odd wells 
first, then to the even wells, to each row on the 24- well plates. Incubate at 37°C for 6 
hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in 
DMEM with lx penstrep, or CHO-5 media ( 1 16.6 mg/L of CaC12 (anhyd); 0.00130 
mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 ) r 9H 2 0; 0.4 1 7 mg/L of FeS0 4 -7H 2 0; 
31 1 .80 mg/L of Kcl; 28,64' mg/L of MgCU 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of 
NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 3 PO 4 -H 2 0; 71 .02 mg/L of 
Na,HP04; .4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic Acid ; 1.022 mg/L 
of Cholesterol; .070 mg/L of DL-alpha-Tocopherol- Acetate; 0.0520 mg/L of Linoleic * 
Acid; 0.010 mg/L of Linolenic Acid: 0.0 10 mg/L of Myristic Acid; 0.010 mg/L of ' 
Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
Pluronic F-68; 0:0 10 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 455 1 mg/L of D- 
Glucose; 1 30.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
of L-Asparagine-H^O; 6.65 mg/ml of L-Aspartic Acid; 29:56 mg/ml of L-Cystirte- 
2HCL-H,0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
mg/ml of L-Glutamine; 1 8.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 
H 2 0; 106.97 mg/ml of L-Isoleucine; 111.45 mg/ml of L-Leucinc; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine;, 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 mg/ml of L- Serine; 101.05 mg/ml of L-Threonine; 19.22 - 
mg/ml of L-Tryptophan; 91.79 mg/rril ofL-Tryrosine-2Na-2H : 0; 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3:24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.319 
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mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B,,; 25 mM of HEPES Buffer; 2.39 mg/L of Na 
Hypoxanthine; 0.105 mg/L of Lipoic Acid; 0.08 1 mg/L of Sodium Putrescine-2HCL; 
55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of 
Ewanolamine; 0.122 mg/L of Ferric Citraie; 41,70 mg/L of Methyl-B-Cyclodextrin 
complexed wiih Linoleic Acid; 33.33 mg/L of Methyl -B-Cyclodextrin complexed 
with Oleic Acid; and 10 mg/L of Methyl-B-Cyclodextrin complexed with Retinal) 
with 2mm giutamine and lx pcnstrep. (BSA (8 1-068-3 Bayer) lOOgm dissolved in 1L 
DMEM for a 10% BSAstock. solution). Filter the media and collect 50 ul for 
endotoxin assay in 15mJ polystyrene conical. " . . " 

The transfection reaction is terminated, preferably by tag-teaming, at the end 
of the incubation period. Person A aspirates off the transfection media, while person 
B adds 1 Jml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
, depending on the media used: I %3SA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml 
deep well plate and the remaining supernatant into a 2ml deep well. The supernatants *' 
from each well can then be used in the assays described in Examples 1 3-20! 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the 
polypeptide directly (e.g., as a secreted protein) or by the polypeptide inducing 
expression of other proteins, which are then secreted into the supernatant. ' Thus, the 
invention further provides a method of identifying the protein in the supernatant 
characterized by an activity in a particular assay: . y 

Example 12: Construction of GAS Rep orter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jatcs-STATs pathway. Activated proteins in the 
Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive responsive element ("ISRE"). located in the, promoter of many genes. The * 
binding of a protein to these clement^ alter the expression of the associated gene. 

GAS and IS RE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators of Transcription, or "STATs." There are six , 
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members of the STATs family., Statt and Stat3 are present in many ceil types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL- 12. StatS was originally called mammary growth factor, but has been found at 
'. higher concentrations in other cells including myeloid cells. It can be activated in i( 
tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 
upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks t represent a distinct family ofsoluble tyrosine kinases and include Tyk2, . , 
Jak I , Jak2. arid Jak3 . These kinases display significant sequence similarity and arc 
generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below. (Adapted from review by Schidler and Darnell, Ann, Rev. Biochem. 64:621- 
51 (1995).).' A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) class I includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12. IL- 1 5,'Epo, PRL, GH, G-CSF, GM-CSF, LIF f CNTF, and thrombopoietin: and (b) 
Class 2 includes IFN-a, IFN-g, and IL-10. The Class I receptors share a conserved 
cysteine motif (a set of four conserved cysteines and one tryptophan) and a'WSXWS 
motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway/ 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines ' 
are known, to activate the Jaks-STATs pathway! (See Table below.) Thus, by using 
GAS elements linked to reporter molecules, activators of the Jaks-STATs pathway 
can be identified. 
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To construct a synthetic GAS containing promoter clement, which is used in 
trie Biological Assays described in Examples 13-14, a PCR based strategy is 
10 employed to generate a GAS-SV40 promoter sequence. The 5' primer contains four 

' tandem copies of the GAS binding site found in the IRF1 promoter and previously 
5 demonstrated to bind STATs upon induction with a range of cytokines (Rothman et 
al.. Immunity 1:457-468 (1994).), although other GAS or ISRE elements can be used 
15 instead. The 5' primer.also contains 18bp of sequence complementary to the SV40 

early promoter sequence and is flanked with an Xhol site. The sequence of the 5' 
primer is: ' 
10 5 ; :GCGCCTCGAGATTTCCCGG AAATCTAGATTTCCCCGAAATGATTTCGCC 
OA A ATG ATTTCCCCGAAAT ATCTGCGATGTG AATTAG: 3 * (SEQ ID NO:3) 

The downstream primer is complementary to the SV40 promoter and is 
flanked with a Hind III site: 5* :GCGGCA AGCTTTTTGCAA AGCCTAGGC:3' * 
25 (SEQ ID NO:4) 

15 PCR amplification is performed using the SV40 promoter template present in 

the B -gal: promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloncd into BLSK2-. (Stratagene.) Sequencing . 
with forward and reverse primers confirms that the insert contains the following 
sequence: ' x ' ' . _ t 

20 S^ CTCGAG ATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGAAA 
35 TGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCG 
CCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCT 
CCGCCCCATGGCTGACTAAlTri riTTATTTATGCAGAGGCCGAGGCCGCC 
TCGGCCTCrGAGCTATrCCAGAAGTAGTGAGGAGGCTTri , 'lTGGAGGCCT 
40 r 25 AGGCTTTTGC A A A A^GCTEE" 3 ' (SEQ ID NO:5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered.' Here, the reporter molecule is a secreted 
45 alkaline phosphatase, or ,f SEAP." Clearly, however, any reporter molecule can be 

instead of SEAP r in this- or in any of the other Examples. Well known reporter 
30 molecules that can be used instead of SEAP include chloramphenicol 

acetyitransferase (CAT) , luciferase,*al kaline phosphatase, B-gaiactosidase, green 
50 fluorescent protein (GFPK or any protein detectable by an antibody. 
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The above sequence confirmed synthetic GAS-SV40 promoter. element is 
subcloned into the pSEAP-Prbmoier vector obtained from Clontech using Hindm and 
Xhql, effectively replacing the SV40 promoter with the amplified GAS:Sy40 
promoter element, to create the GAS-SEAP vector. However, this vector does not 
contain a neomycin resistance gene, and therefore, is not preferred for mammalian 
expression systems. . V; 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using ; 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance T 
gene, such as pGFP- 1 (Clontech); using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 
containing NFK-B and EGR promoter sequences are described in Examples 15 and 
16. However, many other promoters can be substituted using the protocols described 
in these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g.. GAS/NF-KB/EGR, GAS/NF-KB, II- 
2/NFAT, or NF- KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial); Reh (B-celi), 
Saos-2 (osteoblast), HU VAC (aortic), or Cardiomyocyte. 

- »•'''■ i • 

Example 13: High-Throughput Screening Assay for T-cell Activity 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells . T- 
cell activity is assessed using the GAS/SEAP/Neo, construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-, 
STATS signal transduction pathway. The T-cell used in this assay is Jurkat T-cells 
(ATCG Accession No. TIB-152), although Molt- 3 cells (ATGC Accession No. CRL- 
1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 
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■ Jurkat T-cetls are lymphoblastic CD4+ Th 1 helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are trans fected with the GAS- , 
SEAP/neo vector using DMRIE-C (Life Technologies )(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,000 cells per well and trans fectants resistant to 1 mg/ml gemicih selected; 
Resistant colonies are expanded and then tested for their response to increasing 
concentrations of interferon gamma. The dose response of a selected clone is < 
demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to - 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% serum with i.% Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life 
Technologies) with 10 ug of plasmid DNA in a T25 flask. Add 2.5ml OPTI-MEM 
containing 50 ul of DMRIE-C and incubate at room temperature for 1 5-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells ( 1Q 7 per trans fee ti on), and resuspend in OPTI-MEM to a final 
concentration of I0 7 cells/ml. Then add litii of 1 x 10 7 ceils in OPTI-MEM to T25 
flask and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI -t- 15% \ 
serum. . 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticin, and J% Pen-Strep. These cells are treated with . *■ ' 

supernatants containing a polypeptide as produced by the protocol described in 
Example 11. 

On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 1 0% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supernatants 'being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 ., 
million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells 
into a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 
200 ul of cells into each well (therefore adding 100, 000 cells per well). 
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After all the plates have been seeded. 50 ul of the supematants are transferred 
directly from the 96 well plate containing the supematants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma .(0. 1 , t .0, 10 ng) 
is added to weils H9, HI0, and HI 1 to serve as additional positive controls for the 

5 . assay. 

The 96 well dishes containing Jurkat cells treated with supematants arc placed 
in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul 
samples from each well are then transferred to an opaque 96 well plate using a 12 
channel pipette. The opaque plates should be covered (using sellophene covers) and 
10 stored at -20 o C until SEAP assays are performed according to Example 17. The 4. , 
plates containing the remaining treated cells are placed at 4VC and.serye as a source 
of material for repeating the assay on a specific well if desired: 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T ceils. Over 30 fold induction is typically observed in the 1 
15 positive control wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable trans fected cells, which would be apparent to those of skill in the art. 

Example 14: High-Throughput Screening Assay Identifying Myeloid Activity ; 

20 , . The following protocol is used to assess myeloid activity by identifying 

factors, such as growth factors and cytokines, that may proliferate or differentiate 
myeloid cells. Myeloid cell activity is assessed using the,GAS/SEAP/Neo construct 
* produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathway. The myeloid cell ' 
25 used in this assay is U937, a pre-monocyte cell line, although TF- 1 , HL60, or KG 1 ' . 
can be used. 

To transiently transfect U937 cells with thc.GAS/SEAP/Neo construct * 
45 , produced in Example 12, a DEAEDextran method (Kharbanda et. al., 1994, Cell 

. Growth & Differentiation, 5:259-265) is used. First, harvest 2xi0c 7 U937 cells and 
30 ' wash with PBS. The U937 cells are usually grown in RPMI 1640 medium containing' 
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10% hcat-inactivaled feul bovine serum (FBS) supple mented with 100 units/ml 
penicillin arid 100 mg/tnl streptomycin. ' , 

Next suspend the ceils in 1 ml of 20 mM Tris-HCI (pH 7.4) buffer containing 
0.5 mg/mi DEAE-Dcxtran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM ' 
KCI. 375 uM Na2HPO 4 .7H20, 1 mM MgCl2, and 675 uM CaCI 2 - v Incubate at 37°C , 
for 45 mini 

Wash the cells with RPMI 1640 medium containing 10% FBS and then " ' . 
resuspend in 10 ml complete medium and incubate at 37°C for 36 hr. 

/ The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/mi G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting ix 10 H cells (this is enough for ten 96- well 
plates assay) and wash with PBS. Suspend the cells in 200 ml above described 
growth medium, with a final density of 5x10* cells/ml. Plate 200 ul cells per .well in 
the 96-well plate (or 1x10 s cells/well). \ 

Add 50 ul of the supernatant prepared by the protocol described in Example 
1 1 ... Incubate at 37°.C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon i - 
gafruna can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant 
according to the protocol described in Example 17. , 

Example 15: High-Throughput Screening Assay Identifying Neuronal Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
EGR1 (early growth response gene I), is induced in various tissues and cell types 
upon activation. The promoter of EGR1 is responsible for such induction. Using the 
EGR1 prompter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochrornocytbma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TP A (tetradecanoyl 
phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). 
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The EGRl gene expression is activated during this treatment. Thus, by stably 
transfecting PCI 2 cells with a construct containing an EGR promoter linked to SEAP 
reporter, activation of PC 12 cells can be assessed. 

The EGR/SEAP reporter construct can be. assembled by the following 
protocol. The EGR-1 promoter sequence (-633 to +l)(Sakamoto K el al., Oncogene 
6:867-871 (1991)) can be PCR amplified from human genomic DN A using the 
following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3* (SEQ ID NO:6) 
5*. G CG A AGCTTCGCGACTCCCCGG ATCCG CCTC- 3' (SEQ ID NO:7) 
» Using the GAS:SEAP/Neo vector produced in Example 12, FGR1 amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Nco vector 
using restriction enzymes Xhol/Hindlll, removing the GAS/SV40 staffer. Restrict the 
EGR1 amplified product with these same enzymes. Ligate the vector and the EGR1 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type f (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-weil plate, and 
allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI- 1640 medium (Bio Whittaker) 
containing 10% horse serum (J RH BIOSCIENCES, Cat. # 12449-78P), 5% heat- .• t 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 
100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is 
done every three to four days. Cells arc removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 
protocol described in Example 11. EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 ug/ml G418. The G418-frcc medium is used for routine * 
growth but every one to two months, the cells should be re-grown in 300 ug/ml G4 18 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
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(Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FES with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5x10^ 
cells/ml. 

Add 200 ul of the cell suspension to each well of 96- well plate (equivalent to 
lx 10 5 cells/well). Add 50 ul supernatant produced by Example 1 1, 37°C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 ceils through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF); Over fifty-fold * 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16; High-Throughput Scre ening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide 
variety of agents including the inflammatory cytokines IL- 1 and'TNF, CD30 and 
CD40, lymphotoxin^alpha and lyraphotoxin-bcta, by exposure to LPS or thrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-kB 
regulates the expression of genes involved in immune cell activation, control of 
apoptosis (NF- kB appears to shield ceils from apoptosis), B and T-cell development, 
anti- viral and antimicrobial responses ; and multiple stress responses. * 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 
(Inhibitor kB). However, upon stimulation, I- ,kB is phosphorylated and degraded, 
causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 
genes. Target genes activated by NF- kB include IL-2, 1L-6, GM-CSF, ICAM-t and 
class 1 MHC. \ ' 

Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supernatants 
produced in Example 1 1. Activators or inhibitors of NF-kJB would be useful in 
treating diseases. For example, inhibitors of NF-kB could be used to treat those 
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diseases related to ihe acute or chronic activation of NF-kB, such as rheumatoid ' 
arthritis. 

To construct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of theNF-KB 
binding site (GGGGACTTTCCC) (SEQ ID NO:8). 18 bp of sequence complementary 
to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5 ' :GCGGCCTCG AGGGG ACTTTCCCGGGG ACTTTCCGGGG ACTTTCCGGGXc 
TTTCCATCCTGCCATCTCAATTAG:3 (SEQ ID NO:9) 

The downstream primer is complementary to the 3' end of the SV4C promoter 
and is flanked with a Hind HI site: . ■'. ■ 

5^'GCGGCAAGCTTTTTGCAAAGCCTAGGC;3 , (SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
the pB-gal: promoter plasmid obtained from Clontcch*. The resulting PCR fragment is . 
digested with Xhol and Hind HI and subcloned irito BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence:* 

5' :CTCG AGGGG ACTTTCCCGGGG ACTTTCCGGGG ACTTTCCGGGACTTTCC 
ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCC 
ATCCCGCCCCTAACTCCGCCCAGTtCCGCCCAtTCTCCGCCCCATGGCTGA 
CT A A 1 ' 1 ' 1 T 1 1 T " 1 ATTTATGC AG AGGCCG AGGCCG CCTCGGCCTCTG AGCT A 
TrCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCA^ 
GCTT:3* (SEQ ID NO: 10) 

Next, replace the SV40 minimal promoter element present in the pSEAP2- 
promoter plasmid (Clontcch) with this NF-kB/S V40 fragment using Xhol and 
Hindi II. However, this vector does not contain a neomycin resistance gene, and 
therefore .is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NFtKB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes 
Sail and NotI, and inserted into a vector containing neomycin resistance. Particularly, 
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the NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the 
GFP gene t after restricting pGFP-I with Sail and Not!. 
10 Once NF-icB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are 

' created and maintained according to the protocol described in Example* 13. Similarly, 

, 5 the method for assaying supernatants with these stable Jurkat T-cells is also describee! 

in Example 13. As a positive control, exogenous TNF alpha (0,1,1, 10 ng) is added to 
wells H9, H10, and HI i, with a 5-10 fold activation typically observed. 
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; . Example 17: Assay for SEAP Activity 
20 10 As a reporter molecule for the assays described in Examples 13-16, SEAP. 

activity is assayed using the Tropix Phospho-Iight Kit (Cat. BP-400) according to the 
following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 
25 , Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 jil of 2.5x 

15 dilution buffer into Optiplates containing 35 u.1 of a supernatant. Seal the plates with 
a plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid 

uneven heating. * ' • 

30 . . ' 

' Cool the samples to room temperature for 15 minutes: Empty the dispenser 

and prime with the Assay Buffer. Add 50 uj Assay Buffer and incubate at room 
20 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
35 table below). Add 50 fil Reaction Buffer and incubate at room temperature for 20 

s minutes. Since the intensity of the chcmiluminescent signal is time dependent, and it 

takes about 10 minutes to read 5 plates on lurni no meter, one should treat 5 plates at 
each time and start the second set 10 minutes later. 

40 

25 Read the relative light unit in the Juminometer. Set H 12 as blank, and print 

* the results. An increase in chemiluminescence indicates reporter activity. 

45 Reaction Buffer Formulation: ; 



50 



# of plates Rxn buffer diluent (mi) . CSPD(ml) 

lp .- ■ 60 ~ ~ _ 3 

U . 65 t 3.25 

12 ■■■ ' 70 3.5' 
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Example 18: High-Throughput Scre ening Assay Identifying Changes in Small 
Molecule Concentration and Mem brane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium/ and pH, as well as alter membrane - 
potential. These alterations can be measured in an assay to identify supeniatants • 
which bind to receptors of a particular cell. Although the following protocol 
describes un assay for calcium, this protocol can easily be modified to detect changes 
in potassium, sodium, pH, membrane potential, or any other small molecule which is 
detectable by a fluorescent probe. 

The following assay uses Fluororhetric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium' fluorescent molecule, fluo-4 (Molecular Prcbes, inc. ; , 
catalog no. f-14202) , used here. ^ , 

. ' For adherent cells, seed the cells at 10,000 r20,000 cells/well in a Co-star 
black 96-well plate with clear bottom, the plate is incubated in a CO, incubator for 
20 hours. The adherent cells are washed two times in Biotek washer with 200 ul of 
HBSS (Hank's Balanced Salt Solution) leaving 100 ul of buffer aftcrthe final wash. 

A stock soliiuon of 1 mg/ml fluo-4 is made in 10% plurohic acid DMSO. To ■ 
load the cells with fluo-4 ,50 ul of 12 ug/ml fluo-4 is added to. each well. The plate 
is incubated at 37°C in a C0 2 incubator for60 rain. The plate is washed four times in' ■ 
the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5xl0 6 ceils/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 
fluo-4 solution in 10% pluronic acid DMSO is added toeach ml of cell suspension. 
The tube is then placed in a 37°C water bath for 30-60 niin. The ceils are washed 
twice with HBSS , resuspended to 1 x 1 0 6 cells/ml, and dispensed into a microplate, 1 00 
ul/well: The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed 
once in Denley CellWash with 200 ul, followed by an aspiration step to 1 00 ul final 
volume. ■ 



WO 00/06698 • PCT/US99/17130 

" ., ' 338 1 • ' ' '• 

For a non-cc 11 based assay, each well contains a fluorescent molecule; such as 
fluo-4 . The supernatant is added to the well, and a change in fluorescence is 
detected. ' ' / ' 

' To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stopis F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; 
and (6) Sample addition is 50 ui. Increased emission at 530 nm indicates an 1 
extracellular signaling event which has resulted in an increase in the intracellular 
Ca"*" 1 " concentration. ' . 

Example 19: High-Throughput Screening Assay Ide n tifying Tyrosine Kinase 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range'of mitogenic and metabolic growth 
factors including the PDGF. FGF, EGF,' NGF. HGF-and Insulin receptor subfamilies. 
In addition there are a large family of RPTKs for which the corresponding iigand is 
unknown. Ligands for R?TKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by. ligands involves ligand-mediated receptor . . 
dimerizaiion, resulting in transphosphorylation of the receptor subunits and activation 
of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include 
receptor associated tyrosine kinases of the src-family (e.g., sre, yes. lck, lyn, fyn) and 
non-receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, 
members of which mediate signal transduction triggered by the cytokine superfamily 
of receptors (e.g., the Intcrlcukins, Interferons. GM-CSF, and Leptin); 

. -Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of 
activating tyrosine kina.se signal transduction pathways arc' of interest. Therefore, the 
following protocol is designed to identify those novel human secreted proteins 
capable of activating the. tyrosine kinase signal transduction pathways. • 



WO 00/06698 PCT7US99/17130 

■'■ •• • • 339 ' ' ;.. . 

Seed target pells (e.g., primary keratinocytes) at a density of approximately 
25,000 ceils per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethahol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or 
polylysine (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. 
Louis, MO) or 10% Matrigcl purchased from Becton Dickinson (Bedford,MA), or 
calf serum, rinsed with PBS and stored at 4°C. Cell growth on these plates is assayed . 
by seeding 5,000 cells/well in growth medium and indirect quantitation of cell 
number through use of alamarBluc as described by the manufacturer Alamar 
Biosciences, Inc. (Sacramento, CA) after 48 hr. Falcon plate covers #307 1 from f 
■ Becton Dickinson YBedrbrd,M A) are used to cover the Loprodyne Silent Screen 
Plates. Falcon Microtest III eel] culture plates can also be used in some proliferation 
experiments. • .. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
Loprodyne plates (20.000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-free basal medium for 24 hr; After 5-20 ^ 
minutes treatment with EOF (60ng/ml) or 50 ul of the supernatant produced in 
Example 1 1, die medium was removed and 100 ml of extraction buffer ((20 mM 
HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM 
Na4P207 and a cocktail of protease inhibitors (# 1836 1 70) obtained from 
Boeheringer Mannheim (Indianapolis, IN) is added to each well and the plate is 
shaken on a rotating shaker for 5 minutes at 4°C. The plate is then placed in a 
vacuum transfer manifold and the extract filtered through the 0.45 mm membrane 
bottoms of each well using house vacuum. Extracts are collected in a 96-well ■ 
catch/assay plate in the bottom of the vacuum manifold and immediately placed on 
ice. To obtain extracts clarified by centrifugation. the content of each welt, after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 
4°Cat 16,000 xg. 
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Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described 
here. f .-, 

Generally, the tyrosine kinase activity of a supernatant is evaluated by 1 1 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding rhe following components in 
order: First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2 + (5mM 
ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, E < 
pH7.3, 40 mJM bcta-glycerophosphate, ImM EGTA. lOOrriM MgCl^ 5 mMMnCl2, 
0.5 mg/ml BSA), then 5ul of Sodium Vanadate( 1 mM), and then 5ul of water. Mix the 
components gently and preincubate the reaction mix at 309C for 2 min. Initial the 
reaction by adding IOuI of the control enzyme or the filtered supernatanL 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 
1 20mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by, transferring 50 ul aliquot of reaction 
mixture to a microtitcr plate (MTP) module and incubating at 37°C for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. . 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37°C for one hour. Wash the well as 
above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitareri using an ELIS A reader and reflects the level of 
tyrosine kinase activity. 1 
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EXMPp j c 20: High-ThrOMPhPUt Screening Assay Identi fying Phrep hA^fr^ 
Activity 

As a potentiaJ alternative arid/or compliment to the assay of protein tyrosine 
kinase activity described in Example 19, an assay w 

(phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosine, 
phosphorylation of the Erk-1 and Erk-2 kinases. . However, phosphorylation of other 
molecules, such as Rat JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
Sic, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine. or phosphothreohine molecule, can be detected by 
substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay, plates are made by coating the wells of a 96-well EUSA 
plate with 0. 1ml of protein G (lug/ml) for 2 hi at room temp, (RT). The plates are 
then rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G 
plates are then treated with 2 commercial monoclonal antibodies ( 1 OOng/well) against 
Erk-1 ■ : . . " ; 

and Erk-2 (1 hr at RT) (Santa Cni2 Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of 
the above described molecules.) After 3-5 rinses with PBS, the plates are stored at 
4°C until use. . 

A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supernatants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilizcd and extracts 
filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As 
a positive control, a commercial preparation of MAP kinase ( lOng/well) is used in 
place 

of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) 
antibody (lug/ml) which specifically recognizes the phosphorylated epitope of the 
Erk-1 and Erk-2 kinases ( 1 hr at RT). This antibody is biotihylated by standard 
procedures. The bound polyclonal antibody is then quantitated by successive , 
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incubations with Europium-streptavidin and Europium fluorescence enhancing 
reagent in the Wallac DELFIA instrument (time-resoived fluorescence). An increased 
fluorescent signal over background indicates a phosphorylation. 

Example 2 1 ; Method of Determining Alteration s in a Gene Corresponding tn a 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenorypc of interest (such as a disease) is be isolated. cDNA' is then generated from 
these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA 
is then used as a template for PCR, employing primers surrounding regions of interest 
in SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95°C for 30 
seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C using buffer 
solutions described in Sidransky, D.. et al., Science 252:706 (1991). 

PCR products are then* sequenced using primers labeled at their 5" end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exbns is also determined and 
genomic PCR products analyzed to confirm the results. PCR products harboring 
suspected mutations is then cloned and sequenced to validate the results of the direct 
sequencing. ■ „ 

PCR products is cloned into T-tailed vectors as described in Holton, T.A. and 
Graham, M.W., Nucleic Acids Research, 19: 1 156 ( 1 99 1 ) and sequenced with T7 
polymerase (United Slates Biochemical). Affected individuals are identified by 
mutations hot present in unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 
alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5'- 
triphosphate (Boeliringer Manheim), and FISH performed as described in Johnson. 
Cg. et al., Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe 
is carried out using a vast excess of human cot- 1 DNA for specific hybridization to 
the corresponding genomic locus. 

Chromosomes are counterstained with 4.6-diamino-2-phenylidoIe and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
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precise mapping arc obtained using ^a triple-band fiiter set (Chroma Technology, 
Brattleboro, VT) in combination with* a cooled charge-coupled device camera 
(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 
et ai.. Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 
chromosomal Fractional length measurements are performed using the ISee Graphical 
Program System. (Inovisioh Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease, " ' ' . ' 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide in » 
Biological Sample ,t 

A polypeptide of the present invention can be detected in a biological sample, 
and if an increased or decreased level of the polypeptide is detected, this polypeptide 
is a marker for a particular phenotype. Methods of detection are numerous, and thus, 
it is understood that one skilled in. the art can modify the following assay to fit their 
particular needs. . 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microti ter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are 
either monoclonal or polyclonal and are produced by the method described in 
Example 10. The wells are blocked so that non-specific binding of the polypeptide to 1 
the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used 
to validate results.' The plates are then washed three times with deionized or distilled 
water to remove unbounded poly peptide. , ■ * 

Next, 50 ul of specific antiboily : alkaline phosphatase conjugate, at a 
concentration of 25-400 ng f is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 
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Add 75 ul of 4-methylumbelliferyl phosphate (V1UP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. . Measure the reaction by a microliter plate reader Prepare a standard 
curve, using serial dilutions of a control sample, and plot polypeptide concentration • 
on the X-axis (Jog scale) and fluorescence' or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard 
curve. . : • ' . , 

Example 23: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment with the 
secreted polypeptide alone), the site of delivery, the method of administration, the 
scheduling of administration, and other factors known to practitioners! .The "effective . 
amount" for purposes herein is thus determined by such considerations. ; 

As a general proposition, the total pharmaceutically effective amount of 
secreted polypeptide administered parenterally per dose will be in the range of about 1 
Jlg/kg/day to 10 mg/kg/day of patient body weight, although, as noted above, this will " 
be subject to therapeutic discretion. More preferably, this dose is at least 0.01 
mg/kg/day, and most preferably for humans between about 0.01 and 1 mg/kg/day for 
the hormone. If given continuously, the secreted polypeptide is typically 
administered at a dose rate of about I ug/kg/iiour to about 50 ptg/kg/hour, either by 1- 
4 injections per day or by continuous subcutaneous infusions, for example, using a 
mini-pump. An intravenous bag solution may also be employed. The length of 
treatment needed to observe changes and the interval following treatment for 
responses to occur appears to vary depending oh the desired effect. ■ 

Pharmaceutical compositions containing the secreted protein of the invention 
arc administered Orally, rectally, parenteraJly, intracistemally, intravagihally, 
intraperitoneajjy, topically (as by powders, ointments, gels, drops or transdermal 
patch), hucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" 
refers to a nontoxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to 
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modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrastemal. subcutaneous and intraarticular injection and infusion. 

The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- f 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained- release matrices include polylactides (U.S. Pat. No, 
3;773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate ' 
(Stdman, U. et al.. Biopolyuiers 22:547-556 (1983)), poly (2- hydroxyethyl 
methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 15: 1 67-277 (1981). and R. 
Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et al.) or 
poly-D- (-)~3-hydroxybutyric acid (EP 133,988). Sustained-release compositions 
also include liposomally entrapped polypeptides. Liposomes containing the secreted 
polypeptide are prepared by methods known per se: DE 3,2 18, 12 1; Epstein et al., . 
Proc. Natl. Acad. Sci. USA 82:3688-3692 (1985); Hwang et al., Proc. Nad. Acad, Sci. 
USA 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP : 
142,641; Japanese Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and 
EP 102,324.. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) 
'.niiamel-'ar type in which the lipid content is greater than about 30 mol /percent . 
cholesterol, the selected proportion being adjusted for the optimal secreted 
polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the formulation. 
For example, the formulation preferably does not include oxidizing agents and other 
compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 
of the recipient. Examples of such carrier vehicles include water, saline. Ringer's 
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solution, and dextrose solution. Non-aqueous vehicles such as fixed ojls and ethyl '.. 
oleate are also useful herein, as well as liposomes; 

The carrier suitably contains minor amounts of additives such as substances 
that enhance isotonicity and chemical stability. Such materials are non-toxic to 
recipients at the dosages and concentrations employed, and include buffers such as 
phosphate, citrate, succinate, acetic acid, and other organic acids or their salts; 
antioxidants such' as ascorbic acid; low molecular weight (less than about ten 
residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as . 
polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspanic acid, or 
arginine; monosaccharides, disaccharides, and other carbohydrates including cellulose ' 
or its derivatives, glucose, manose, or dextrins; chelating agents such as EDTA; sugar * 
alcohols such as mannitol or sorbitol; counterions such as sodium; and/or nonionic 
surfactants such as polysorbates. poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 nig/ml, at a pH of 
about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes 
(e.g., 0.2 micron membranes). Therapeutic polypeptide compositions generally are 
placed into a container having a sterile access port, for example, an intravenous 
solution bag or vial having a stopper pierceable by a hyriodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers,' for - 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
formulation for recoristitution. As an example of a lyophilized formulation, 10-rhl 
vials are filled with 5 ml of sterile-filtered 1 9c (w/v) aqueous polypeptide solution, . 
and the resulting mixture is lyophilized. The infusion solution is prepared by 
reconstituting the lyophilized polypeptide using bacteriostatic Water.for-Injection, 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of ihe ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice in 
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the form prescribed by a governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, which notice reflects approval by the 
10 agency of manufacture, use or sale for human administration. In addition, the 

polypeptides of the present invention may be employed in conjunction with other 
5 therapeutic compounds. 

15 Example 24: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by' 
20 ' ' 10 administering the polypeptide of the present invention, preferably in the secreted 

form, thus, the invention also provides a method of treatment of an individual in f 
need of an increased level of the polypeptide comprising administering to such an 
individual a pharmaceutical composition comprising an amount of the polypeptide to 
25 increase the. activity level of the polypeptide in such an individual. 

15 For example, a patient with decreased levels of a polypeptide receives a daily 

dose 0.1- 100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
ad^inistraticr. and fcrrr.ulation, are provided in Example 23. 
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20 Example 25; Method of Treating Increa sed Levels of the Polypep tide 

35 < Antisense technology is used to inhibit production of a polypeptide of the 

present invention. This technology is one example of a method of decreasing. levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such as 
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cancer. 



40 _ 

±5 For example, a patient diagnosed with abnormally increased levels of a 

polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 

2.0 and 3.0 mg/kg day for 21 days, this treatment is repeated after a 7-day rest 

period if die treatment was well tolerated: The, formulation of the antisense 

polynucleotide is provided in Example 23. v 
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Example 26; Method of Treatment Using Gene Therapy 
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One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Smafl c hunks of the tissue are placed on a wet surface of 
a tissue culture flask, approximate iy ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10%'FBS, 
penicillin and streptomycin) is added. The flasks are then incubated at 37°C for 
approximately one week. ' . 

At this time, fresh media is added and subsequently changed every several 
days. After an additional two. weeks in culture, a monolayer of fibroblasts. emerge. 
The monolayefis trypsinized and scaled into larger flasks. 

pMV-7 (KJrschmeier, P.T et al., DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and - 
Hindlll and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. ■ 

. The cDNA encoding a polypeptide of the present invention can be amplified 
using PCR primers which correspond to the 5' and 3' end sequences respectively as set 
forth in Example 1 . Preferably, the 5* primer contains an EcoRI site and the 3* primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions . 
appropriate, for ligation of the two fragments. The ligation mixture is then used to 
transform bacteria HB 101 . which arc then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA3 1 7 or GP+am 1 2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 
10% calf serum (CS), penicillin and streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells transduced with the vector. 
The packaging cells now produce infectious viral particles containing the gene, (the 
packaging cells are now referred to as producer cells). 
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Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
t containing the infectious' viral particles, is filtered through a milliporc filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the 
media from the producer cells. This media is removed and replaced with fresh media. 
If the titer of virus is high, then virtually all. fibroblasts will be infected and no 
selection is required. If the titer is very low, then it is necessary to use a retroviral 
vector that has a selectable marker, such as neo of his. Once the fibroblasts have been 
efficiently infected, the fibroblasts are analyzed ro determine whether protein is - 
produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodei 3 microcarrier beads. 



Example 27: Method of Treatment Using Gene Therapy ■ In Vivo 

Another aspect of the present invention is, using in vivo gene therapy methods 
to treat disorders, diseases and conditions. The gene therapy method; relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) 
sequences into an animal to increase or decrease the expression of the polypeptide. 
The polynucleotide of the present invention may be operatively linked to a promoter 
or any other genetic elements necessary for the expression of the polypeptide by the 
target tissue. Such gene therapy and delivery techniques and methods* are known in 
the art, see, for example, WO90/1 1092, W098/M 779; U.S. Patent NO. 5693622, 
5705151, 5580859; Tabata H. et al. (1997) Cardiovasc. Res. 35(3):470-479, Chao J et 
al. (1997) PharmacdL Res. 35(6):5 1 7-522, Wolff J.A. (1997) Neuromuscul. Disord. 
'7(5):3 14-3 18,- Schwartz B. et al. (1996) Gene Ther. 3(5):405-41 U Tsurumi Y. et al. 
( 1 996) Circulation 94( 1 2):328 1 -3290 (incorporated herein by reference). 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial 
space of tissues (heart. musclcskin, lung, liver, intestine and the like). The 
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polynucleotide constructs can be delivered in a pharmaceutical ly acceptable liquid or 
aqueous carrier. 

The term "naked" polynucleotide, DNA or KNA. refers to sequences that are 
free from any delivery vehicle thai acts to assist: promote, or facilitate entry jnto the ■ 
cell, including viral sequences, viral particles, liposome formulauons, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present 
invention may also be delivered in liposome formulations (such as those taught in 
Feigner P L. et aL ( 1 995) Ann. NY Acad. Sch 772: i 26- 1 39 and Abdallah H. et al. 
(1995) Biol. Cell 85(1 ): 1-7) which can be prepared by methods well known to those 
skilled in the art. . 

The polynucleotide vector constructs used in the gene therapy t method are * 
preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow rbr replication. Any strong promoter known to those skilled in \ 
the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into 
target cells is the transitory nature of the polynucleotide synthesis in the cells. Studies 
have shown that non-replicating DNA sequences can be introduced ; into cells to ' ' 
provide production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of 
tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone \ 
marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 
stomach, intestine, testis, ovary, uterus, rectum,, nervous system, eye, gland,- and 
connective tissue. Interstitial space of the tissues comprises the intercellular fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers 
in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same 
matrix within connective tissue ensheathing muscle cells or in the lacunae of bone. It 
is similarly the space occupied by the plasma of the circulation and the lymph fluid of 
the lymphatic channels. Delivery to the interstitial space of muscle tissue is preferred 
for the reasons discussed below: They may be conveniently delivered by injection 
into the tissues comprising these cells. They are preferably delivered to and 
expressed in persistent, non-dividing cells which are differentiated, although delivery 
and expression may he achieved in non-differentiated or less completely 
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differentiated cells, such as. for example, stem cells of blood or skin fibroblasts. In 
vivo muscle cells are particularly competent in their ability to take up and express 
polynucleotides. . •' • 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RXA will be in the range of from about 0.05 g/kg body weight to about 50 mg/kg 
body weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 
mg/kg and more preferably from about 0.05 mg/kg to about 5, mg/kg. Of course, as 
, the artisan of ordinary skill will appreciate,' this dosage will vary according to the 
tissue site of injection. The appropriate and effective dosage of nucleic acid sequence 
can readily be determined by those of ordinary skill in the art and niay depend on the 
condition being treated and the route of administration. The preferred route of < 
■ administration is by the parenteral route of injection into the interstitial space of > 
tissues. However, other parenteral routes may also be used,, such as, inhalation of an 
aerosol formulation particularly for delivery to lungs or bronchial tissues, throat or ' 
mucous membranes of the nose. In addition, naked polynucleotide constructs can be 1 
delivered to arteries during angioplasty by the catheter used in the procedure.. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for * * 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA r which may be either circular, or 
linear, is either used as naked DNA or complexed with liposomes, the quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female, and male Balb/C mice are anesthetized by ■ .<< 
• intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1 .5 cm incision is made on 
the anterior thigh, and . the quadriceps muscle is directly, visualized. The template ' 
DNA is injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over 
one minute; approximately 0.5 cm from the distal insertion site of the muscle into the .. 
knee, arid about 0.2 cm deep A suture is placed over the injection site for future 
localization; and the skin is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 15 urn cross-section of the 
individual quadriceps .muscles is histochemically stained, for protein expression. A 
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time course for protein expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at differeni times. Persistence of DNA 
in muscle following injection may be determined by Southern blot analysis after 
preparing total cellular DNA and HIRT supernatants from injected and control mice. 
The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked 
DNA. 

Example 28: Transgenic Animals. 

, The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice* rats, rabbits, hamsters, 
guinea pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate transgenic animals. In a ' 
specific embodiment, techniques described herein or otherwise known in the art, are 
used to express polypeptides of the invention in humans, as part of a gene therapy 
protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not limited to, pronuclear 
microinjection (Paterson et aL Appl. Microbiol. Biotechnol. 40:691-698 (1994): ■ 
Carver et aL, Biotechnology (NY) 1 1: 1263-1270 (1993); Wright ct a!., Biotechnology 
(NY) 9:830-834 (1991); and Hoppc ct al., U.S. Pat. No. 4,873,191 (1989)); retrovirus 
mediated gerie transfer into germ lines (Van der Putten et al, Proc. Natl. Acad. Sci., - 
USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic 
stem cells (Thompson et al., Cell 56:313-321 (1989)); clcctroporation of cells or 
embryos (Lo, 1983, Mol Cell. Biul. 3:1803-1814 (1983)); introduction of the 
polynucleotides of , the invention using a gene gun (see, e.g., Ulmer et al., Science 
259:1745 (1993); introducing nucleic acid constructs into embryonic pleuripotcnt 
stem cells and transferring the stem cells back into the blastocyst; and sperm- 
mediated gene transfer (Lavitrano et al:, Cell 57:717-723 (1989); etc. For a review of 
such techniques, see Gordon, "Transgenic Animals," Intl. Rev. Cytol. 115:171-229 
(1989), which is incorporated by reference herein in its entirety. 
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, Any technique known in the art may be used to produce transgenic clones 
containing- polynucleotides of the invention, for example, nuclear transfer into 
enucleated oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to 
quiescence (Campell et ah. Nature 380:64-66 (1996); Wilmut et ah, Nature 385:8IOt 
813(1997)). 

The present invention provides for transgenic animals that carry the transgene 
in all their cells, as well as animals which carry the transgene in some, but not all thetr 
cells, i.e.. mosaic animals or chimeric. The transgene may be integrated as a single - 
transgene or as multiple copies such as in concatamers, e.g., head-to-head tandems or 
hcad-to-tail tandems. The transgene may also be selectively introduced into.and 
activated in a particular cell type by 1 following, for example, the teaching of Lasko et 
al. (Lasko et ah, Proc. Nath Acad. Sci: USA 89:6232-6236 (1992)). The regulatory 
sequences required for such* a cell-type specific activation will depend upon the 

t .s 

particular cell type of interest, and will be apparent to those of skill in the an. When 
it is desired that the polynucleotide transgene be integrated into the chromosomal site 
of the endogenous gene, gene targeting is preferred. Briefly, when such a technique is 
to be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene may also be selectively 
introduced into a particular cell. type, thus inactivating the endogenous gene in only 
that cell type, by following, for example, the teaching of Gu et ah (Gu et ah, Science. 
265: 103- 1 06 ( 1 994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent 
to those of skill in the art. 

Once transgenic animals have been generated, the expression of the 
recombinant gene may be assayed utilizing standard techniques. Initial screening 
may be accomplished by Southern blot analysis or PCR techniques to analyze animal 
tissues to verify that integration of the transgene has taken place. The level of mRNA 
expression of the transgene in the tissues of the transgenic animals may also he / 
assessed using techniques which include, but are not limited to, Northern blot analysis 
of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 
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transcriplase-PCR (rt-PCR). Samples of transgenic gene-expressing tissue may also 
be evaluated immunocytochemically or imrriunohistochemically using antibodies 
specific for the transgene product . , ' . . 

Once the founder animals are produced, they may be bred, inbred, outbrcd, or 
* crossbred to produce colonies of the particular animal- Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more 
than one integration site in order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics 4 that express the transgene at higher 
levels because of the effects of additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals, homozygous for a given 
integration site in order to both augment expression. and eliminate the need for 
screening of animals by DNA analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; and breeding- to place the 
transgene on a distinct background that is appropriate for an experimental model of 
interest. 

. Transgenic animals of the invention have uses which include, but are not 
limited to. animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating 
such conditions and/or disorders. 

Example 29: Kqp<;k-CM Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. (E.g., 
see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503- 
512 (1987); Thompson etal., Cell 5:313-321 (1989); each of which is incorporated by 
reference herein in its entirety). For . example, a mutant, non-functional 
polynucleotide of the invention (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous polynucleotide sequence (either the coding 
regions or regulatory regions of the gene) can be used, with or without a' selectable 
marker and/or a negative selectable marker, to transfect cells that express 
polypeptides of the invention in vivo. In another embodiment, techniques known in . 
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the art are used to generate knockouts in cells that contain, but do not express the gene 
of interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 
research and agricultural fields where modifications to embryonic stem cells can be 
used to generate animal offspring with an inactive targeted gene {e.g., see Thomas & ■'" 
Capecchi 1987 and Thompson 1989, supra). However this approach can be routinely 
adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivn using appropriate viral vectors that 
will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered • 
to express the polypeptides of the invention, or alternatively, that are genetically 
engineered not to express the polypeptides of the invention (e.g., knockouts) are 
administered to a patient in vivo: Such cells may be obtained from the patient (i.e., 
animal, including human) or an MHC compatible donor and can include, but are not 
limited to fibroblasts, bone marrow cells, blood cells (ej>., lymphocytes), adipocytes, 
muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using - 
recombinant DNA techniques to introduce the coding sequence of polypeptides of the 
invention into the cells, or alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the polypeptides of the invention, . ; 
e^g., by transduction (using viral vectors, and preferably vectors that integrate the 
transgene into the cell genome) or transaction procedures, including, but not limited 
to, the use of plasmids, cosmids, YACs, naked DNA, electroporation, liposomes, etc. 
The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The 
engineered cells which express and preferably secrete the polypeptides of the 
invention can be introduced into the patient systemically, e.g.. in the circulation, or 
intraperitoneal^, •" 

Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e^g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et al. U.S. Patent No. 5,399,349; and 



WO 00/06698 PCT/US99/17130 

• ' 356 

Mulligan & Wilson, U.S. Patent No. 5,460,959 each of which is incorporated by 
reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC 
compatible cells* they can be administered using well known techniques which 
prevent the development of a host immune response against the introduced cells. For 
example, the cells may be introduced in an encapsulated form which, while allowing 
for an exchange of components with the immediate extracellular environment, does 
not allow the introduced cells to be recognized by the host immune system. 

Transgenic and "knock-out* animals of the invention have uses which include, 
but are not limited to. animal model systems useful in elaborating the biological 
function of polypeptides of the present invention, studying conditions and/or disorders 
associated with aberrant expression, and in screening for compounds effective in 
ameliorating such conditions and/or disorders. 

it will be clear that the invention may be practiced otherwise than as ; 
particularly described in the foregoing description and examples. Numerous 
modifications and variations of the present invention are possible in light of the above 
teachings and, therefore, are within the scope of the appended claims. 

, The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples 
is hereby incorporated herein by reference. Further, the hard copy of the sequence 
listing submitted herewith and the corresponding computer readable form are both 
incorporated herein by reference in their entireties. 
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What fs Claimed Is: 

I . An isolated nucleic acid molecule comprising a polynucleotide having 
a nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment 
of the cDNA sequence included in ATCC Deposit No:Z. which is hybridizable to 
SEQ ID NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; 

■(c) a polynucleotide encoding a. polypeptide domairi of SEQ ID NO:Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit' 
No:Z, which is hybridizable to SEQ ID NO:X; , 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit 
NoiZ, which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit Nb:Z, which is hybridizable to SEQ ID NO:X, 
having biological activity; . ' 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homoiogue of the SEQ ID 
NO:Y; , ■' *" .' ' 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a) : (h), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2. . The isolated nucleic acid molecule of claim I; wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted 
protein. ' 



WO 00/06698 



363 



PCT/US99/I7130 



3. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO: Y or the polypeptide encoded by. the cDNA sequence 
included in ATCC Deposit No:Z, which is hybfidizable to SEQ ID NO:X. 

4. The isolated nucleic acid molecule of claim' 1, Wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X 
or the cDN A sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X. ; 

5. The isolated nucleic acid molecule of claim 2, wherein the nucleotide : 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. , 

6. The isolated, nucleic acid molecule of claim 3, wherein the nucleotide* 
sequence comprises sequential nucleotide deletions from either the" C-terrninus or the 
N-ierminus. . . . ■ ■ * 

A recombinant vector comprising the isolated nucleic acid molecule of 

A method of making a recombinant host cell comprising the isolated 
molecule of claim I. . * 

A recombinant host cell produced by the method of claim 8. 

The recombinant host cell of claim 9 comprising vector sequences. 

An isolated polypeptide comprising^ amino acid sequence at least > 
95% identical to a sequence selected from the group consisting of: L •..*** 



1. 

claim 1. 



8. 

nucleic acid 

. 9. 
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(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence 
included in A TCG Deposit No: 2; . * 

(b) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence • 
included in ATCC Deposit No:Z t having biological activity; 

, (c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; ' 

(d) a polypeptide" epitope of SEQ ID NO:Y or the encoded sequence included ' 
. in ATCC Deposit No:Z; . 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(0 a fall length protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No-Z; 

(g) a variant of SEQ ID NO: Y; 5 ■ 

(h) an allelic variant of SEQ ID NO:Y; or 

^; (i) a species homologue of the SEQ LD NO:Y. 

12. The isolated polypeptide of claim 1 1 , wherein the secreted form or the 
full length protein comprises sequential amino acid deletions from either the C- 
lerrninus or the N-rterrninus. 

13. An isolated antibody that binds specifically to the isolated polypeptide ;V 
of claim II. . ' " ' 

14. A recombinant host cell that expresses the isolated polypeptide of 
claim 11. 

15. A method of making an isolated polypeptide comprising: 

'. , (a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and , ' , 

« (b) recovering said polypeptide. " 



16. The polypeptide produced by claim 15. 
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17. A method for preventing, treating, or. ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount 
of the polypeptide of claim li or the polynucleotide of claim I. 

18. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition uri a subject comprising: 

(a) determining the presence or absence of a mutaLion in the polynucleotide of 
claim 1 ; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

,19. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: . , 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the polypeptide of claim 
1 1 comprising: , , L '\ 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity pf the 
polypeptide. > , 

21.. The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

. (a) expressing SEQ ID NO:X in a cell; ' " 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and , 

(d) identifying the protein in the supernatant having the activity.' " 
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23. The product produced by the method of claim 20. ■ 
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<210_> 1 
<211> 733 
<212> DNA 
, <213> Homo sapiens 



<400> 1 • 

gggarccgga gcecaaaict cctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tcqccccaad acccaaggac accctcatga. 120 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 

tcaagttcaa ctggtacgtg gacggcgtgg : aggtgcataa cgccaagaca aagccgcggg • 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact ' 300 

ggctgaatgg caaggagtac aagtgcaagg. tctccaacaa agccctccca acccccatcg 1 360 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 

catcccgggd tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct - . 480 

atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

ccacgcctcc cqtgctggac tccgacggct cctr.cttcct ctacagcaag ctcaccgtgg 600 

.acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

acaaccacta cocgcagaag agcctctccc ' Lytctccggg taaacgagcg cgacggccgc 720 ! 

gactctagag gat 733 



<210> 2 . 
<211> 5 ■ 
<212> PRT 
<213> Homo sapiens 

<220>, 1 

<221> Site ^ - 
<222> {3} 
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<223> Xaa equals any of the twenty naturally ocurring L-amir.6 acids 
' <400> 2 

Trp Ser Xaa Trp Ser ' 

. 1 . ' 5 ' \ ■ • ' 



<210> 3 

<21i> 8S , * ., 

<212> DNA . . ^ • ' , 

<213> Homo sapiens' 

<400> 3 ( . ' ' • • • • 

gcgccccgag arttccccga aatctagatt tccccgaaat gattcccccg aaacgatttc 

cccgaaatat ctgccatcbc aattag 



60 
86 



<210> 4 
<211> 27- 
<212> DNA 

<213> Homo sapiens * 
<400> 4 

gcggcaagct ttttgcaaag cctaggc 



27 



<210> 5 
' <211> 271 ■• 

<212> DNA ■ ' . ' ,.' 

<213> Homo sapiens * * 

<40O> 5 ., ' 

■ ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 
J aaatatctgc catctcaat t - agtcagcaac catagtcccg c'ccccaactc cgcccatccc 
gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 
'ttatgcagag gccgaggccg cctcggcctc t gage tat t c cagaaqtaqt 'gaggaggctt 
ttttggaggc ctaggctttt gcaaaaagct t ■ ' • ' 



60 
120 
.180 
240 
271 



<210> 6 t ■ • ' 

<211> 32 • ' ' .' 

<212> DNA 

<213> Homo sapiens 

<40O> 6 • . * ' , 

gegctcgagg gatgacagcg atagaacccc g'g ' ' 32 



<210> 7 '. , . 

<211> 31 ' 

<212> DMA : . 

<213> Homo sapiens 1 . ' ; ' 

<400> 7 ■ "'. A . , . ' ' 

gegaagctte gcgactcccc ggatccgcct c * , 31 



<210> 3 
<211> 12 
<212> DNA 
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<213>, Homo' sapiens 

<400> 8 ' ,- . " - 

ggggacttt'c cc . ^2 

<210> 9 • " ' • • • ' 

■' <:211> 73 • , ' 

<212> DNA . ' 

<213> Homo sapiens ■ 

<400> 9 ' „ 

gcggccccga ggggac-tfcc ccggggactc tccggggact ttccgggact ccccatcctg 60 
ccatctcaau tag ♦ " ■ ' 73 



<210> 10 . " . • : 

<211> 256 ' . . ■ • , ' • " •• 

<212> DNA . " ' ♦ 

<213> Homo sapiens ' ' * ' 

<40n> 10 ' ' 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 60* 

caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 120 

.cagttccgcc cattctccgc cccatggctc actaatcttf tttatttatg cagaggccga . i80 

ggccgcctcg gcctcngagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 240 

ctttrgcaaa aagctt 256 



<210> 11 
<211> 1564 
<212> DNA 

<213> Homo sapiens 



<400> 11 • ' ' * ... • 

gcggacggtg ggctgtgcaa ccttcctccc tttcttaaat gcttggggca \ t~ tgtctggc 60- 

cttccctttt actgctggct getgcctgca tctgtctctt aaccctcatt aactgtgcct 12 0 

■atgtcaaatg gggaaccctg gtacaagata t tt tcaccta , tgctaaagta ctggcactga '1B0 

tcgcggtcat cgctgcaggc attgttagac ttggccaggg agcctctact cattttgaga 240 

attcctrtga gggttcatca tctgcagtgg gtgacattgc cctggcactg tactcagctc 300 

tgttctccta ctcaggctgg gacaccctca actatgtcac tgaagagatc aagaatcctg . 360 

agaggaacct gcccctctcc attggcatct ccatgcccat tgtcaccatc atctatatc: 420- 

tgaccaatgt ggcctatrat actgtgctag acatgagaga catcccggcc ' agtgatgctg . 480 
ttgctgtgac^ ttttgcagat cagatatttg gaatatttaa -ctggataatt- ccactgccag , ' 540 

ctgcattatc-ctgttttggc ggcctcaacg cctccactgt ggctgcttct aggctt-tct 600 

ctgtgggctc aagagaaggc car.rr.rcccg,. atgccacctg catgatccat 'gttgagcggt 660 

ncacaccagt gccttctctg ctcttcaatg gtatcatggc atcgatctac tcgcgcgtgg ' 720 

aagacatcr.t ccagctcatt aautactaca gcttcagcta ccggctcttt -gtggggcttt 780 

ctattgtggg ccagctttat ccgcgctgga aggagcctga tcgacctcgt ccccccaagc R40 

tcagcgtttt cttcccgatt gtcttctgcc tctgcaccat cttcctiggtg gctgttccac .900 
tttacagtga tactatcaac tccctcatcg gc'attgccat tgccctctca ggcctgccct ' 950 

tttacttcct cancatcaga gtgccagaac ataagcgacc gcLttacctc cgaagatcgt 1020 
ggggtccgcc acaaggtacc tccaggtcct gtgcatgtca, gttgctgcag aaatggattt ' 1080 
ggaagatgga ggagagacgc ccaagcaacg ggatcccaag tctaactaaa paccatctgg ' 1140 

aatcctcacg tggaaagcag gggtttctgg tctactggct agagctaagg aagttgaaaa 1200 

ggaaagctca cttccttgga ggcacctgtc cagaagcccg gcctaggcng .cttcaacctt 1260 

tgaacttact ttttgaaatg aaaagtaatt tatttgtttt gctacatact gctccagact ■ 1320 

tttaaagggg acaatgaagg cgactgtggg gaggagcatg tcaggtztgg . gc'ttggttgt 1380 

tttagaagca cctgggtgtg cctacctact cctcttttct tttaaaaggg cccacaatgc 1440 
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tccaatctcc tgxctccttt agagagacat gaaactatca eaggr.gctgg atgacaataa 1S00 

aagtttargt tcctaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaagggcggc 1S6Q 

CCCg " ( . . . 1564 

<210> 12 • . • • . ' 
<2H> 175? 

<212> DNA ■ 1 

<213> Homo sapiens , ■ '* ' ' 



<400> 12 . , *■ ' 

ggcacgaggt agatgggstc ttggtgtgga tgtcctttct gtttgttagt tgcccttcta 60 

acagacagga ccctcagctg caggtctgtt ggagtaccct gcaatgtgag gtgtcagtgt 120 

gcccctgctg gagggtgcct cccagttagg ctgctcgcgg gtcaggggtc agggacccac 180 

ttgaggaggc agtctgcccg ttczcagatc tccagctgca tgctgggaga acsactgctc ' 240 

tcttcaaagc tgtcagacag ggaeatttaa gtccgcagag gttactgctg tctttttgtt 300 

tgtctgtgcc ccgcccccag aggtggagcc tacagaggca ggcaggcctc ctcgagctgt 360 

ggtgggctcc acccagtLcg agcttcccgg ccgcctcgtc tacccaagca agcctgggca 420 

atggcgggcg cccctccccc agtctcgctg ccgcctcgca gtttgatctc agactgctgt *480 

gctagcaatc* agcgagactc cctgcggtag gaccctctga gccaggtggg ggarrataatc *540 

ttgtggtgcg ccattmtt'ta agcccgtcgg aaaagcgcag tattcaggtg ggagtgaccc • 600 

gattttcccg attttccagg tgccgtccgt catccctttc tctgattagg aaagggaact * 660 

ccctgacccc ttgcgcttcc cgagtgaggc aatgccttgc gccgctttgg ctcgtgcatg 720 

gtgcgtgcac ccactgacct gcgcccactg tctggcactc cctagtgaga tgaacccggt • 780 

acctcagatg gaaatgcaga aatcacccgt cttctgcgtt gcccaggctg ggagctgtag -840 

accagagctg ctcctattcc gccatcttgg ctcctccccc accccccgcc eaatcctaag 900 

tttatctctc aacaaagtaa atactgcaca cgtatttaca tgttttaaao tattgtttat 960 

ggagcactta ctatgtgttit tacggtccaa gtactttata cta^ccactg actctttata "* 1020 

gcaactctat aacatattaa tagattgtct caatttcact gatgacaaaa ctggcagctt 1080 

agtaactaat gagtaacaat accagcccct aaaacctagg gtctttccac tacagcacac 1140 

tctgactgac attgcr.cr.tg t tcagacaca gcactcetca cctagaaaag gatggagagc * 1200 

tagaaagtgg ttrtcrtttt caagttgttc ccattcatca atattctatc aagatatatg 1260 * 

, gaagagttat tttctagg.tc Ltcxagaagg ggctgc-gac acctgggaat tcaaaataat 1320 

tactccagag ggaaatgtgg gcatagaaac tttttaaaaa ctgaattttc ccgatctaat 1380 

atcggcttcc acatacagta tattatttct taaaaatcag ccagaaatgc acctgcagtt 1440 

.ttatctacrg gattttcctt tcccagcccc ' attcttc ttt aagtagtttt tcttccccat 1500 

tttataaaag aaaagcacac ctcccaccta tagcttccct ccttatggta tatcatctca 1560 

ggggataatt egaaatattg cLuqagacac aaccctcact: aagaaataaa ctttcaggct 1620 

gggcgcagtg gctcacactt gtaatcccag aactttggga ggccaaggcg ggggaggatc 1680 

cctgaaggcc aagagttcaa gaccagcccg gaatatatac caagatcccc tctccacaaa 1740 

aaaaaaaaaa aaaaaaa - t 1757 



' . . T 

<210> 13 

<211> 1373 * 
<212> DNA „ , 

<213> Homo sapier.s ' 



<400> 13 . ' ■ • 

gtgaaatgaa acatgtatac caaaacacat aaatttaagt ttcataatct* cctttccttg 60 

tggatctgga aaat-ttctc tccgctgctt ttcartccga tagttgcttt ggctttccct 120 

attccctgcc rttccatatt ttgaaatgtt ■ ttggtcaaau LLLttcctgt cgagtgtgca • 180 

gtatgtcttg accgttaatt. tcattgacca tcatcccagt ggattattna -ttgaaactt .240 
ctctgttccc cgttctggaa tgtttcctgc cccggaagcc arattttaac tgtatactga ' 300 

tccagcctta tztatcttaa gtatttttac aactatttta- ctgataatat acaactagg^ 360 

ataatcactc tagtcagtga aatcaaactg ataacacaaa atgatgctgc tag^aaatti 420 

agcctgaaaa ggatcagagg .aagcactttt Cagcctaagt agtcaagatc aataaactta 480 

gaatgagagt tttgtattat ctagccttgt gatagaag'ga aaaagtacag ccaccgaatc 540 

tgtatattaa tttazgcaaa acacacacat latctttcca ttaacacagt agtatgacat 600 
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acagttctat caccactgga cccatgcatg cttaaaatga aaatgaaagg aacacacagg 

aaagtagctt cttatcttcc cagatacagc ctccaagagt gccnctgcrg tgtactggag 

tgagtttgaa gaagttcagg taaccctctc ctaaccagca gctcaactta aagtgcatct: 

tcattcttgL cctaccacta aaagcttgct tattgacctc cctccttccc' gcctcctaaa 

ttgtgactgg c:ctcagtta' gacctgcatc tgttgccart tgtagcagca attgcaggat 

tcatctgctt: tttc-cttgc agtgtatttc tgaatgtt-t ataactctgg gtatgcataa 

tgtacgcata tgcatatcca tatggtcaag agttttcaga actctttgtt tgaataatga 

acgtgtgtat tccctacccc ttgtLLtigct ctgtcttaac attcaagctt tgcttattaa 

aatgtatttg CLLCaatgct ctctcrtttag attgtaagct cttcatgggc agtggtcaga 

zat^ctgtat ttgtaact-c ccgagaatct agtattttat catgtacata atgaacatag 

aatagtattg attggaagga cagcgaagta caaaatgaga ctctaaaatg acattttcat 

tagggctcac aagtgtataa gaag^atccc tttcctattt ttttgtttaa cactttagaa 

gctattacat taaaaaagct a^tctcacta aaaaaaaaaa aaaaaaaaaa aaa 



<210> 14 
<211> 3740 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (957) 

<223> n equals a,t,g, or c 



<400> 14 

acrtctctaag gcfctttttgg actcaccaaa caggcttctt gcagtggaya tgaatacaga 60 

tcacttaagg tngactgtgc caaatggcat aggggccccg aagctaaggg aaatggaaca 120 

ccacttccca cagggcctgt cagttcagct gcttaatgat gggtccaagg gcaaactcaa 180 

tcattcatgt ggagctgact'.ttgtgaaaag tcatcagaaa cctccacagg gaatggaaat 240 

taagtccaat gaaagatgct gttcttttga tggagargca gacagaattg tttaetacta 300 

ccatgatgca gacggccact ttcatctcat agatggagac aagatagcaa cgctaattag 360 

cagtttcctt aaagagctcc tggtggagac tggagaaagc ttgaatattg gtgttgtaca 420 

aactgcacat gcaaatggaa gttcaacacg gtatcttgaa gaagttatga aggtacctgt 480 

ctattgcact aagactggtg taaaacattt gcaccacaag gctcaagagt rtgacattgg 540 

agtccatctt gaagcaaatg ggcacggcac tgcactgttt * agtacagctg ctgaaacgaa 600 

gataaaacaa tcagcagaac aactggaaga taagaaaaga aaagctgcta agatgcttga 660 

aaacat^att gacttgtt ta accaggcagc tggtgatgct acctctgaca tgctggtgat 720 
tgaagcaatc ttggctctga agggcttgac tgtacaacag tgggatgctc tctatacaga . 780 

- tcttccaaac agacaactta aagctcaggt tgcagacagg agagttaf.ta gcactaccra 840 

tgctgaaaga caagcagtta cacccccagg attacaggag gcaatcaaeg acctggtgaa 900 

gaagtacaag ctttctcgag cttttgtccg gccccctggt acagaagatg tcguccngag 960 

tatatgcaga agcagactca caayaaagtg cagatcacct tgcacatgaa gtgagctxgg 1020 

cagtattcca gctggctgga ggaatcggag aaaggcccca accaggtttc tgaaganaat 1090 

tttcatattc ctgagaaacc ggacttttta caagtctcca caaaactgtc aataataatg 1140 

gcagtactaa gagarttata atcataatgt ttacaatgca gcctactgga ctgtctctag . 1200 

atctgttttt cttaaacact aacagaataa ttcttrataa ataggtaagc cttacacttg 1260 

ttaaagaaat ttaccuctaa ccccagcctc ac'aacgtaa aatactggga cttaagtata 1320 

caattcagcc actaactgta cagttttatg tggggaacaa ttcatgcagg ccactggaaa 1380 

attaaatxtt artaccaact ccttgtgata tctttgccat caccatcaca tgagcaagat 1440 

gatgttrtgc agcattcccc attgctgaca caaatggaga gggcagagaa gactctatac 1500 

aaccag.tt;. tccattgcag agtcttaaga aagaLtatta gaegactcac cratatgact 1560 

aatgccatca ggaactcaga ggtazgaata gggggttgtc catccctctt ccatactgag 1620 

gtggagatgc tcatgcaata cctccaagga tgcatggtcc agccttcagt ta-tcttcac 1680 

cgctcttggt gaaggtatgt gggagaaaaa ctaattataa tacgtttccc agcctctgat 1740 

ggagaaggaa caccattctg ataccagaac atggttaata aggaoaagag aadddtcccc 1800 

aaccaatctt aa:tgaacca agtctgaaac caatggaaaa aaaaaatggg tagtgtatat 1860 

tttgcaggtt taagacaaci caggacaata aaaacaaigg actttacatg tgtatatata 1920 

tagcccrc-t aggcaccata atcagtatga gccaacaata ttcaaacttg attcaggcca 1980 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1373 
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cattcagaca tttgctctita catacaaata tttaaattaa atacaatctg ?, aaatgtgttc. 2040 

tgtt'acatac aaaaaaggaa aaactataca acgcagagca gtgtgtgtgt tttaaataat 2100 

tacatttaca tgtaagctaa atggaaccag caatggtgct caagttttta tcatcccttc 2160 

cagaaaaLct tttcccacca tctctcctat ttcctgccrg gctttgctgg aacatggttt 2220 

gtggttctcc agtttcatgt ccttattagg gaaggcattt gagtagagga taggactccc 2280 

tgagtgtcct ccaca'tcggc ttgtgacttt gctgttgaag acttgactga gcacatcgaa s 2340 

, gaacggcagg agctgcccca tactgcgcac ggtgcagatg gtgagcagca agtgccctgg - - 2400 

ctcccaaccc aacgttctcc ctgagtLgtc ttcccctgga tttttctgca gaaaacaaaa 2460 

agtgaactgg tattaataca acagacaatg tggtatgtta gaaaaattaa aaatatataa ,2520 

actttggcaa ttggtcaaga aatgaataca aatgacatta agtttctaac tcctgacccg 2580 

atcaaaaccc ttggtgcttc tgagaccttt tactgccact v tattagtctt acatggagca 2640 

gtctaacatt gtagtaatag ttcccaacta gaatgcgcag ataagcttag titaacagaaa 2700 

tagctttgaa caggaacaga gtcaaacata aaagttttat gttgtgcttt gtatttactc 2760 

aaaaagcccc caggtttctg aaccctcact actgtaacca aggactaggt cacaaaatta 2820 

ctacagaaaa aaggaacaaa 'gtgctttata catttcataa tatatcccct tttattataa 2880 

ttagttaatt cccrtttatc taaatggcct aaatttgcca tgatggtagc agtgtccana 2940 

.gtgaataatt actgtcagta ctgcatcaca gagaaaggaa gggatccctc aggagacact 3000 

g'ctgtctcct -ctgggttgt gctaaacaac atagggagga aagctggacc tggagtcaaa 3060 

ggaattgagt cagtgtgctg gctctgccat acttacggca* cccttgggca ggatatacaa * 3120 

aggttcctca cttataaaat gggacagtct aaaactacct tttagtagag aagtcaaatg ' 3180 

agaaggtatg tgaaaactct gtcaactaaa tataaagact aataatctgg gtattaagag 3240 

gctagtttga gaagccacct' gaattacaca aacacagcta cagacatcat . tctgtctaga 3300 

gaaagataag agagaacagg' ttggttgaac ttgggcagaa tcacagatac aattccacac 3360 

taaagaacga aaataagcaa. tgaactagac agaaggaaga aatcatgaag acttaggaag 3420 

cagaattaca atctgtcata ttaacaaatg gagtttgcct tctaagatca gatgttgctc 3480 

agaaactttc attgtttacc taataattta atatcactag tttcctagtg ggtcaagcag - 3540 

atgcaaaatc cagcttattt tcttctacgt gctctcaagc ttattgctta ttttaaagta 3 600 

aaatcctgaa aaaggaaaat attaggttgg cgcaaacgta attgcggttt Ctgcattgtt 3660 

gaaatttgcc gttttataet. ggagtacatt cttaaacaaa tgtggttatg ttatacaaaa 3720 

aaaaaaaaaa aaaactcgag - ' 3740 

<210> 15 

<211> 1196 ( 
<212> DNA ' 
. <213> Homo sapiens ' . 

<400> 15 ; * ' ; ■ 

ccacgcgtcc ggcgaatctg gatgatctat ttatgtcazt tatctatgtg tctcttctta 
tgccagtacc ccactgtcct gattattgtg gctttatggt gtcttgaaat taagtattgt 
taagctttgc aaccttctcc aagattgttt tggctatagg tcctttcttt acaaagttta 
tattaagctt ctccattttt ctctaagaaa aaatctgctg gaatggcact gaatttttag 
gccgatttgg gaagaaccga caaca^tgaa cctztcaazc gatggacatg gcatgtttct 
gcatttattt gggccttcta aaatttatct cagcaatatt ttgtagcttt agtgaagaag 
tcttgtatat atcttttgtt aaatgtattc ctaaatattt cgtggaaatg ttactgtaaa 
tagtacttta act tear. r. r.r. taagtttatt gctggta-ac agaaatatgt tttatttttg 
.taatattgac ttttatar ~c % tgtaatctta tttaaattca ■ cttagttcag tagtagttct 
tatttttatt ttattgeaag tgccttaggg' ataccttgtg tacacaatcc atgtcatata 
tgaataacca acaggtttac cttttgtttc ttcttgcttt gtcgtaatgg ctaggacctt 
ccaacctaaa tggttgaata gaagcggtgt gtgagttcat tactcttgtc ttattctcaa 
cagcagagga tagegtctag tttctcacta ttaaaatatg ttagctatag gttttttccg 
tttcatget- ttatcagatt gaagatccct tttattccLa ttttgccaag agtgtcatga, 
atgcatgttg aacttcacca agtacttntt cctgcatctg ttgagagaat cttgacttti 
ctcttctatt ttgttaatgt agattctcta atgttaaact gatcttgeat tcctgatat.a 
aaccatactt agtcatgata tgttatcccc tttatacaat gctggattcg gtctgcaaat 
attctgtttg atttttgeat etatgetcat gagacagatt ggcctgtgot tttccttLuc 
tgtaagtctt gtcaggtgcc agcgcttggt taattttgac acacttacaa acacatatt'" 
ttaagataaa aa:aaacttc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 



60 
' 120 
180 
240 
. 300 
'360 
420 
480 
,54 0 
' 600 
660 
.720 
v 780 
, • 840 
900 
960 
1020 
1080 
1140 
1196 
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<210> 16 

<211> 2209 

<212> DNA i 

<213> Homo sapiens 



<400>. 16 
gagctgcgcc 
ggccgaggcg 
gggttctgct 
. tgccacgaga 
aaccctgtta 
aaagaatagt 
<agaaaaatat 
cgacatcgtt 
gcatcggttt 
gctgcccaga 
ttggaacctt 
tggccttaea 
gctcatctat 
ccctgatgtc 
ttgtcgttcc 

. gtragaggaa 
tgttcaagct 
aaggctgtga 
tccaacatcc 
ccatgtcact 
gtttcgtgtg 
aacacaaaat 
csataagytg 
gtgacatcci: 
aggacaggct 
agccctgcaa 

- acttcggctc 
cggtgaccct 
cggtgtcctr. 
aaagcaaggt 
tgtcctggaa 
ggagcggcgt 
gggctgagga 
acagcttctg 
tcttctcggt. 
cgggaggtgg 
aaaadarama 



gggcctgggc 

tcccggagtc 

gctgggcgcg 

aagcaacett 

catcgtcatt 

ctttacctct 

tgactgtatg 

gttgcctacc 

agagctgcag 

cggagtcact 

ctgcagcaat 

cctcccatgg 

aaaacgcctg 

tgccaactac 

tgcacacctg 

ggaggagcgg 

ggaggacaag 

ccaagatgcc 

acaaaatgaa 

caccatcgag 

Cctagaatct 

ctccttcctt 

cacagaccac 

vcamctgcct 

cagcctggtg 

caccagcttc " 

cttccgcccg 

tcgcacccct 

tataccttat 

ctacccgagg 

catcagcgtg 

ggtctgccag 

gate tt cage 

ggtcaacatc 

gacacttacc 

agtcttactg 

aacaaggggc 



gcctggggcc 
acggccggcc 
gcgcgcctgc 
acagttctca 
tctaaaagac 
agetgecaga 
tcaggcccac 
ctcaacagaa 
ttttccatcc 
cactccatca 
ggcactgcgt 
ttccacccca 
tgeatcateg 
ccagaaggct 
cgggccagcg 
gctgaatact 
cagcctggga 
caaagtccag 
agcaataaaa 
ccacggcccg 
eggacbtgea 
tgtgacgatc 
cggtactgcc ' 
gtggagcr.gr 
ctggtgccag 
agctacctcg 
ggaggctcta 
gcccccagct 
ttcaaagagg 
acccccaacc 
cccagagacc 
acagggegeg 
ctggacgagg 
tytaaytgma 
ccaaggactg 
c-gtctgccc 
cccgctgtgg 



gccgctcccc 
cgaaccgcgg 
cgcgcggggc 
taaagctggg 
atataaccat 
gtcccgagaa 
gtccttttgg 
ctttcatctg 
ctcgcctgag 
gcggccgaat 
cccggatcaa 
,gaaatgtct= 
aqtctgtgtt 
tccctgagga 
tctccttcct 
acatcccggg 
acatggcggg 
ggatcctccg 
Lctacgtggt 
tcaaacagag 
gtagcaacct 
tgacaegtet 
aaaggaaatc 
atgaettcte 
cccagaagct 
tggccagtgc 
tcaagcagat 
tccgacaaga 
aaggcgtttc 
gggaccgggg 
aggtggcctg 
cattcatgat 
atgtgctccc 
gcccca'cgag 
tggacttgac " 
tegggctcat ■ 
gtatctacaa 



accgtcgttt 
ggtctcratc 
agaagctttt 
gaccccgacz 
' gttgtccatc 
tcactttgtc 
ggaggttcag 
ggatgtcaaa 
geagateggt 
cgatgccacc 
gatgeawgaa 
cggcztcagc 
tgagggcgaa 
tgagctcatg 
caacttcaac 
ccccaccacc 
gaacttcaac 
getgeagtte 
tgacttgagt 
ccgcaagttt 
caccctgaca 
gtggatgaat 
ctactcacty 
ctggaagctg 
gcagcagcat 1 
cataccca.gc 
ccaggtgaag 
ggcctccagg 
cacggtgacc' 
cc tgeca tec 
cccgactttc 
catccaggag 
caagccaagc 
eggcaagcag 
tgtcatcctc 
catttgetge 
tggcaacat 



Lccccaccga 
• geactgetag 
qagattgetc 
ctgctggcaa 
aagtctggag * 
atagagatcc 
cttcagccct 
gctcataaga 
ceggg-gaga 
gtggtcagga 
ggagtgaaaa 
attgeaaace 
ggcr.cagcaa , 
acgtggcagt 
ctctccaact 
aaccccgagg 
ctctctctgc 
caagttttgg 
aatgagegag 
gtccctggct 
tctggctcca 
gtggaaaaam 
caggtgccca 
ctggtgccca 
aedcacgaga , 
■caggacctgt 
cagaacatct 
cagggtctga 
cctgacacaa 
cccacctctg 
tttaaggagc 
cagcggaccc V* ' 
ttccaccatc 
ctagacctgc 
ategcagegg - 
gtgaaaaaaa 



60 
' 120 
180 

; 240 

'300 
360 
420 
480. 
'540 
600 
560 
720 
780 
. 840 
900 
960 
1020. 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 , 
1620 " : 
1680 
1*740 
1800 
1860 
1920 
1980 . 
2040 ■ 
.2100 , 
2160 
2209 



<210> L7 

<211> 1774 

<212> DNA 

<213> Homo sapiens 



<400> 17 > 
taggcacccc, 
•gacaccaatt 
ccctcactaa 
tggatccccc 
gaggttccca 
cccag^ccgc 
ggtcggtgtt 



aggctcttac 
tcacacaagg 
agggaacaaa 
gggctgeagg 
geaggatgee 
ceggggcagg 
catgatcctg 



actttatgtt 
aaccagctta 
agctggagct 
aattcggca'c 
ccggctctgc 
azgaccaagg 
ctgatcatcg 



ccggttcgta 
tgaccatgac 
ccaccgcggt 
gagcggccgc 
aggaagctga 
cccggctgtt 
tgtactggga 



tgttgtgtga 
tacgccaagc 
ggcggccgct 
gggegegagg 
agtgagaggc 
ccggctgtgg 
caacgcaggc 



attgtaaccg 

tcgaaattaa 

ctagaactag 

tgaggggege 

ccggagaggg 

ctggtgctgg ■ 

gccgcgcact 



60 
'120 
180 
240 
300 
'360 
420 



WO 00/06698 



PCT/US99/17130 



8 



tctacttgca cacgtccttc tctaggccgc acacggggcc gccgczqccc acgcccgggc 480 

cggacaggga cagggagctc acggccgact ccgatgtcga cgagtutctg gacaagtttc 540 

t tcagtgctgg cgtgaagcag agtgaccttc ccagaaagga gacggagcag ccgcctgcgc 600 

cggggagcat ggaggagaac gtgagaggct acgactggtc cccgcgcgac gcccggcgca " 660 
gcccagacca gggccggcag caggcggagc ggaggagcgt gctgcggggc ttctgcgcca . 720 

actccagcct gaccttcccc accaaggagc gcgcattcga cgarafcccrrr: aac tcggagc 780 

tgagccac'ct gatcgtggac gaccggcacg gggccatcta ctgctacgtg cccaaggtgg 840 

cctgcaccaa etggaagcgc gtgatgaLcg Lyetgagcgg aagcctgctg caccgcggtg • y00 

cgccctaccg cgacccgctg cgcatcccgc gcgagcacgt gcacaacgcc agcgcgcacc 960 

tgaccttcaa caagttctgg cgccgctacg ggaagctctc ccgccacctc atgaaggtca * 1020 

agctcaagaa gtacaccaag ttcctcttcg tgcgcgaccc cttcgtgcgc ccgatctccg 1080 

- ccctccgcag caagttcgag ccggagaacg aggagttcca ccgcaagttc gccgtgccca 1140 

tgctgcggct gtacgccaac cdcaccagcc tgcccgcctc ggcgcgcgag gcctcccgcg' 1200 

ctggcctcaa ggtgtccttc gccaacttca, tccagtacct gctggacccg cacacggaga 1260 

agctggcgcc cttcaacgag cactggcggc aggtgtaccg cctctgccac ccgtgccaga 1320 

tcgactacga cttcgtgggg aagrtggaga ctctggacga ggacgccgcg . cagctgctgc 1380 

agctactcca ggtggaccgg cagctccgct tccccccgag ctaccggaac aggaccgcca 1 1440 

gcagctggga ggaggactgg ttcgccaaga tccccctggc ctggaggcag cagctgtata 1500 

aactctacga ggccgacttt gttctcttcg gcraccccaa gcccgaaaac ctcctccgag 1560. 

actgaaagcr ttcgcgttgc ctrttctcgc gtgcctggaa cctgacgcac gcgcacf.cca ' 1620 

gtttttttar; gacctacgat cttgcaatct gggcttcttg ttcactccac cgcptctatc 1680 

catcgag^ac tgtatcgata ttgtttttta agattaacat 'att tcaggca tttaatacga 1740 

aaaaaaaaa'a aaaaaaac;c yagggggggc ccgg 1774 



<210> 18 

<211> 1674 * V 

<212> DNA , 

<213> !i Homo sapiens-. 

<220> ' - ' : 

<221> SITE ' ' 

. <222> (1649) 

<223> n equals a.t.g, or c 

<220> * 

<221> SITE 

<222> <1663) 

<;223> n equals a,t,g ( or c ' 

<220> ■ , ' 

<221> SITE 

<222> (1665) , 

<223> n equals a,t, g, or c 



<40.0> 18 , f . 

caagttggza cgcctgcagg taccggtccg gaattcccgg gtcgacccac gcgtccgg'tc t SO 

gaagataggt tcgagacggt ggatgttgca gctgaccatg cagttgggc'c cggcgccgct " 120 

cacacgccgc cccttttggg gctgcttcag ccagctcatg ctgtacgccg agagggctga 180 

ggcacgccgg aagcccgaca tcccagtgcc ttacctgtat ttcgacatgg gggcagccgt 240. , 

gctgtgcgct agtttcatgt ccttcggcgt gaagcggcgc tggttcgcgc tgggggccgc 300 

actccaactg gccattagca cctacgccgc ctacatcggg ggc;:au-yLec actacgggga 360 

ctggctgaag gtccgtatgt acccgcgcac agttgccatc atcggcggct ttcttgtgtt 420_ 

ggccagcggt gctggggagc tgtaccgccg gaaacctcgc agccgctcco tgcagtccac 480 

ccgccaggtg ttccdgggta tctacctca't ctgtgtggcc tactcactgc agcacagcaa 540 

ggaggaccgg ctggcgtacc tgaaccatct cccaggaggg gagcngotgn cccagctgtt 600 

cttcgtgctg catggcatcc tggccccggc ctttctgtca ggctactacg tgaccctcgc ' 660 

:;gcccagatc ctggcngtac tgctgccccc tgtcatgctg ctcattgatg gcaatgttgc 720 

ntactggcac aacacgcggc gtgttgagtt ctggaaccag atgaagctcc r.tggagagag 780 , 
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tgtgggcatc ttcggaactg ccgtcatcct ggccactgat ggctgagttt. tatggcaaga . 940 ', 

ggctgagatg ggcacaggga gccactgagg gtc'accctgc cttcctcctt gctggcccag 900 

. ctgctgttta tctatgcttt tcggtctgtt tgtttgatct ttcgcttctt taaaattgtt 960 

ttttgcagtt aagaggcagc tcaLt.tg^cc aaattcctgg gctcagcgct tgggagggca 1020 

ggagccctgg cactaatgct gtacaggttt ttttcctgcc aggagagc.g aggccagctg X080 

cccactgagt ctcctgtccc tgagaagg'ga gtatggcagg gctgggacgc ggctartgag 1140 
agtgggagag tgggagacag aggaaggaag atggagat-g gaagtgagca aatgtgaaaa ; 1200 

attcctcttr gaaccnggca gatgcagcta ggctctgcag tgctgtttgg agactgtgag 1260 

• agggegtgtg tgtgttgaca catgtggatc aggcccagga agggcacagg ggccgagcac 1320 ' 
tacagaagtc acatgggttcr tcagggtatg ccaggggcag aaacagtacc ggctctctgt, ' 1380 

cactcacctt gagagtagacr cagaccctgt tctgctctgg gctgtgaagg gctggagcag 1440 

gcagtggcca gctttgccct trcrgctgtc tctgtttcta' gctccatggt tggcctggtg 1500 
ggggtggagc tccctcccaa acaccagacc acacagtcct ccaaaaataa acattttaca * 1560 

cagamaaaaa aaaaaaaaaa. aagggcggcc gccccagagg atccctcgag gggcccaagc 1620 

ctacgcgtgc atgcgacgtc a-agctctnt ccctatagaa gtngnaaagg gttc , "' 1674 



<210> 19 ' , 

<211> 2018 

<212> DNA 

<213> Homo sapiens 

<220> 

,<221> SITE 
<222> (20101 

<223> n equals a.t.g. or c 
<220> 

<221> SITE 
<222> (2012) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE , 

<222> (2014) , 

<223> n 'equals a,t,g, or c 

<220> . : , ' • ' 

<221> SITE 
<222> (2016) 

<223> n equals a,c t g, or c 



<400> 19 . * , 

cagcaaLgaa ,'atcctgccct ctcttcctca gaactacrat attcagtggc taaatggctc ■ 60 1 

cctgattcat ggtttgtgga atctxgcttc ccttttttcc aacctttgyt tatt-gtatt, 120 

gacgcccttt gcctttttct ttctggaatc 'agaaggctr.r. gnrggcctga aaaagggaac 180 

ccgagcccgc atttcagaga ct-tggtcat gcttcttcrt cttgcgitac ' tcat'tcttgg 240 

gatagcgtgg gtagcttcag cacccactga uaacgalgcc gcaagcatgg ■ aatcttcara 300 
cgauctccgg gagttccatc taccccattt atattcctgt atatcattga tgggatgttu > 360 

grtacttctc ttgtgcacac cagttggcct tcctcgtatg ttcacagtga tgggtcactt 420 

gctagtgaag ccaacaatr.r: ttgaagacct ggatgaacaa actnatatca- ttaccctaga. 480 

ggaagaagca ctccagagac gactaaatgg gctgtcttca tcggtggaat acaacaLaat 540 

ggag*t.tggaa caagaacntg aaaatgtaaa gactcttaac acaaaatnag agaggcgaaa 600 

aaaggcttca gcatgggaaa gaaatttggt gtatcccgct gttatggttc; tccttcttat 660 

tgagacatccTatctcggtcc tcttggtggc ttgtaatatt ctttgcctat, .tgg'ttgatga 720 

aacagcaacg ccaaaaggaa caagggggcc tggaatagga aatgcctctc ttt=ta=gtt 780 - 

tggctttgtg ggagctgcgc ttgaaatcat tt^ga^tccc tatcttatgg cgtcctctgc 840 . 

tgtcggcttc' tatagccttc gaLCLCCcgg aaaccctact cccaagaaag atgacacaac 900 ' 

tatgacaaag atcattggaa at tgtgtgtc , catcttiggtt ttgagctctg ctccgcctgc 960 
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gatgtcgaga acactgggaa tcactagatt tgatccacct ggcgactttg gaaggtttaa 
itggctggga .aatttctata ttgtattatc ctacaatttg' ctttttgcta ttgtgacaac 
attgtgtctg gtccgaaaat tcacctctgc agttcgagaa gaacttttca aggccctagg 
gctrcataaa cttcacttac caaacacttc aagggattca gaaacagcca agccttctgt 
aaatgggcat cagaaagcac cgtgagac'gc acagacggcg tcttccgcca ccaagagacc 
cgagaactcc agattcacga cattcctgtc ccatgcagaa gcatttccat tcaaccgtgg 
cccctcttca gaacctagac ctatcagtgc catttctttt tcataatcta cgaagaactt 
ggctatggct gatctttttt aaat-taact ttctgatgga ccctgtagtt: nccagttaag 
tgcagattcc ttacagacat a:agaacdgc gcattcutct gtagacattt gctcatgttg 
gtaaacacaa tcacccatat gaaaaaattg ttttcacctg atatgaaaat gtcagaaaag 
gcaaactccg ggacttczaa agatttacct .aaatcccatt atgtacttta ttcagaatgt 
agaagctgac rtgaaaggca tccttggtac taagtgaagc ttattcagaa aatgcatttt 
tcaaacgcaa tggcaactgc ttgtagatat catttttgca ' gtgtatgttg gagctgtaat 
ggttgcaatt atgtctctta tttccttaaa agcaaaaagc gtagtttctg atttatgtta 
tagaacgata ctgattagac tttgagccaa ggggaaaata ctaaattctt. ttaaacctgg 
agccttagag agccacagga atatcttctg *ttgtacagtc taataagctg tggtaggaag 
tatcatgtaa tcacagttta atgacagttt atgtatatat ataattcagt attccctcga 
gggggggccc ggtacccaat tcgcccaagn tnancnag 



<210> 20 

<211> 2098 

<212> DMA 

<213> Homo sapiens 



<40O> 20 

ggcacgaggg accgagcr.af. trctcctggga ctggctatga tggtgtgctc catcatgatg ■ 60 

tattttctgc tgggaaccac actcctgcgc tcatacatgc agagcgtgtg gaccgaagag 120 

'tc-caatgca ccctgctgaa tgcgtccatc acggaaacat ttaattgctc cttcagctgt 180 

ggtccagact gctggaaact ttctcagtac ccctgcctcc aggtgtacgt taacctgact 240 

tcttccgggg aaaagctcct cctctaccac acagaagaga -caataaaaat caatcagaag 300 

tgctcctata tacctaaatg tggaaaaaac cttgaagaat .ccatgtccct ggtgaatgtt 360 

gtcatggaaa acttcaggaa gtatcaacac ttctcctgct attctgaccc agaaggaaac 420 

cagaagagtg ,ttatcccaac aaaactctac agttccaacg. tgctgttcca ttcactcttc 480 

tggccaacct gtatgatggc rgggggtgtg gcaattgttg ccatggcgaa acttaicacag 540 

taccnctccc tactatgtga gaggatccaa cggatcaata gataaatgca aaaatggata 600 

aaataatttt tgttaaagcc caaatactgt tttctttcat tcttcaccaa agaaccttaa 660 

gtttgtaacg tgcagtctgt tatgagttcc ctaatatatc cttatatgta gagcaataat 720 

gcaaaagctg ttccatatgc aaacatgatg tctttattat tcaggagaac aaataactgt 780 

c-cgtgrtgg ttggtggttt tcataatctt atttctgtac tggaactagt actttcttct 840 

c^cattccgc caaaacaggg ctcagttatt catttgccaa gcttcgtgga ggaacgtagg - 900 

tgacatcaat gtgataaagt . ctgtgttctg agttgtcaga tctcttgaag acaacatttt* 960 

tcatcactta ttgtttacta aagctacagc caaaaatatt tttttttctt attctaaact 1020 
gagccctata gcaagtgaag ggaccagact tcctaattaa aggaagttag gtacttttct , 1080 

tgtatttttt accatatcac tgtaaagaag aggggaaacc cagccagcta ctttttttca 1140 
tcacttttta ttcataactt cagatttgta aaactaattt ccaaaatata agctgttttc ' 1200 

attagcbagt tctataatat cttcctgtga tttatgtaga aaatgaacac accccttttc 1260 

catttaagac cctgctactg tgtgaagaga tgatacttac aaggagtgtc aitacctgtg 1320 

agctgactga "atgttggrag gtgctccatt acaatccagg aaagtctgtg ttactgatat 1380 

ttgtgtggaa atctttattt cacttcaatt taaccattag atggtaaaat taagatgcta 1440 

ctcgttggta aaaattggtg gactggtttc aatgggtaaa r.gtgttgtgg caaattaatg 1500 

tgr.t.ggaata ttgctctttg tgaatttgtg cctaagtcaa tgaatgtgta gtatctcctt, 1560 

ctgacaagca ttccctatcg ggattttaaa gctatg-gca cagaatatta gtctctncta 1620 

catgttctat ttttctattt ataartccct cttttgttgt tatattctat acacagaaca 1680 
gatctttttt ctaacacata tttgaactga ataacagact taaagaaagc ctttgttcac . 1740 

attgctattt acttttgtgt tt-gggggaaa atacgaggga ttgattttaa ataaaaaaca 1800. 

ttccatcttt , catttaatat caatatcaaa agaagaagac aaacacctat cntrcncatc 1B60- 

tatatttaag taccttt-tg caatgtagca tcaaagtttc ttaggraatg caaaat~tta 1920 

caaattattt .gtggaatgaa tggtaaaact aatctga-ga aatggaaaat tatzctgcaa 19B0 
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tattgtaatccatagttiiga cttttcataa' gcaaa-aaat ccctaggatg taatcaggac 2040 
ttcaaacgtg taattaaatt tttttaaaaa aaatctaaaa aaaaaaaaaa aaaaaaaa 2098 



<210>' 21 * ' V •• ; ■ 

<211> 1746 '' 

<212> DNA ■* ^ '* 

<213> Homo sapiens > 

.<400> 21 . . ' 

gqgtccgcgc acctccagct cgggccgatg tggaagcttt ggagagctga agagggcgcg 60 

gcggcgctcg gcggcgcgct cctcctgetg ctcttcgcgc taggggtccg ccagctgctg 120 

aagcagaggc ggccgacggg cttccccccg gggccgccgg ggctgccatt tatcggcaac . 180 

acctattccc tggcagcctc atccgagctt . ccccatgtct acatgagaaa gcagagccag 240 

gtgtacggay aggtacagcc ccgacgggcc ccgggcaggg agggccgcca ggctggcccg 300 

ggctggccag ggccttcctg gttggactta tggccgcccc tgggccgact ageegggacc 360 

tctccgtgtg ccggctgccc tttgagggac acccgcttcc cgggtctgga agggagaagt 420 

cctcgacgcc gtgccccctt gcagggggag ccccgcccct gccggtgacc cactccgggc 480 

cgaggctccg aggcgatcca gLcctgattc ccccgcnacc gctcgagctc Ctgctcctgc 540 

gcctgcgccg tttggctcgc cagccgcgcc gccacttcag gtccagggtg oacgcatgcc 600 

cccaggcgcg ggcgccctgc gagtcggcct cgcagctctg tggaagctgc acgcggcttg 660 

tcggaaaatc aaggcgztct gagfctrtaga tggttaatag caggtccctc ggtgtctgca 720 

gtcgacgaac gactggtgta ggcgtttgct gtgagaatgg agaatgcagg ggaacgcccc 780 

tgactgagaa gcgggccctg ggaaacgatt gtgaacgcgt gaatgaattg argactaaaa 840 

cccgctgcgg gggtcctaca gcgcagatgg taatgccgtt ctgactggct gggaacggca 900 

ccttagcaga tacttaaaag, gcgccttctg tgtgccactg tcactgccaa cttggtgact 960 

catttaaaac tcataaccag ccggtgaggt cggtacztcg ctcctcctca ttctgcggag 1020 

gggaaagcag cacggaaatg. ccctgtgact ggcagcggaa aoggcgacca ccgcttgtgt 1080 

gtgggegtcc cgacgcccgg agggggcagg agtctccacg ggtcctggga cagagctcac 1140 

ctgttttgtt ttgaattaca cttatttata. tgcaactaca ggcctgacgc tagcggtgaa 1200 

gaaggcagat acagcctttt aaggagttgg cagatgagtg ggagagagaa aactaatctc 1260 
attatcggcc acaggctgtg gtcagtgttt tgaaggaaaa gtacagggat ' gtttggcaac ' 1320 

tgtggtatfct caggtttgac cttaaatcct cacttaaacc agtttttaca aggactggtc 1380 

taggcgcccg ggcgcggtgc tcacgcccac aatcccagca ctttgggagg ccgaggcggg 1440 

cggatcacga aatcaggaga tcgagaccgt cgtggctaac acggtgaaac cccatctcta 1500 

ctaaaagaat acaaaaaatt ggccgggcgt ggtggcgggc acctgtggtc. ccagctattc 1560 

gggaggctgg ggcaggagag tggcgcgaac ccgggaggcg gagctttcag tgagccgaga L620 

tcgcgccac.t gcaccccagc .ctgggcaaca gagccagact ccgtrtcaaa aaaaaaaaaa 1680 

aaaaagggcg gccg=zctag aggatccaag cttacgtacc, cgtgcatgcg acgtcaa'tag 1740 



<210> 22 

<211> 2876 

<212* DMA 

<213> Homo sapiens 



<400> 22 : 

ccacgcgtcc ggcggcgagg cgccggtggc ggctgagtcc gtggtggcag aggcgaaggc 60 

gacagctcta ggggttggca ccggccccga gaggaggatg cgggnccgga tagggctgac 120 

gccgctgctg tgtgcggccjc r.gctgagctt ggcctcggng tcctcggatg aagaaggcag 180 \ 

ccaggatgaa tcctitagatt ccaagactac cttgacacca gatgagtcag taaaggacca 240 

caccaccgca ggcagagtag ttgctggtca aatatttctt gacccagaag aatccgaatt 300 

agaatcctct actcaagaag aggaagacag cctcaagagc caagaggggg aaagtgtcac 360 

agaagatatc agctttctag agtctccaaa Cccagaaaac aaggactatg aagagccaaa 420. 

gaaagtacgg aaaccaggca gtctggacat fct-.-rctcgct tttcgattta tccaggggac 480 ' 

aactgaaaat tttaagctaa tgaataaaga ggctgaagaa gactggcttc actcattatt 540* 

acccacaaat aatatatgga gtgtagtttg gagagaattt cnagatctta atataccaaa 600 

gttattcact zaacagatcc ga'cccaagtt acacaagctg aattccatag atgagaccgc 660- 
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atttatcttt ttatcaatat cctgacgttt ctttt-ggaa tcaagtatgt aaataccttt ■ 720 

ctcactgaga aatttaaaaa aataatatgc . cagggcagca cacaaaacag aaactcagga 780 

agaactgtct aacatgccac tatggttatt ttcaggatzg tggtaggatt attaccacaa " 840 

taacctcgtc gatcatccta ggcaattcga tccaggtgct tacccctcag ctgacgctcg 900 

tgcgcacwtc catcctattc taactttcac aatcttgaaa cctctacttt tcaatttaaa 960 

acttgacctt tttcacaata gtctzc'taaa tattaatcct tttaaaaatt attaar.gtat 1020 

gcattctacc angtcttgtt gcccttctgt catttactat ttcaatgatc actgccttat 1080. 
ctgtttatta ttccatcact aaccactctc aggaactgat ccaaattgga acLttttaaa ' 1140 

aaaatagaat r.tttttttac tacaaatata cca'ttctcat gaatagaaaa ccatttttaa ,1200 

aaaattaaaa gtacatgaat cccatcaact agaaataaca-gctgctcaca ccatggagaa 1260 

tattctccca ggatttcttr aggtgtatca gtagttttaa gaaaagtaaa ttgggtcaaa 1320 

atgtgtattc tatgtfcgcag cctgcatttt gcact'taaca gtttaccatg aatgtttttc 1380 " 

catgtcattt agctiat-tc caatttgata gttaatggct ataaaaaatt ttattcgtaa 1440 

, atatatggta ccataaacca aaacgtttat gt tttcctgg ;gagatcactr tagatgtatc 1500 

tccgcgccta tctgtgaaaa nttccttaga attcttaaag taaatttgcr gaattggatg 1560 

taaaagttta aaagacaqta .gatacacatt gctaaattgt tcttcagaaa agtgattccg 1620 

tttacttcct ctgaaaatac ctctttccct gcaccccacc caagtgtigta ttaccactrt- 1680 

aaaatttcta ctaaatataa tttaattcar tcttcatLtt ttaactaggg at~ttctctt 1740 

ttttttttta gL'^uttacat ccccagagac . atggtc^cac - tctgtcactc aggctggagt 1800 
gcagcggcac gatcatagct cactgcagcc ttgaactcag gtg'atctccc tctttggccf. * .I860 
cccaaagtgc tgggattaca ggcatgagac accaca'tcaa. gcctgggaaa cttttaaaca ' - 1920 

tgcataaaag tagagggtaa aatgacaccc cttaccaacc ctcactcagc ttcaatoaac I960 

atcaacaccc tgccggtctt ttcatcttta ctaccccaca cacacttgcc ctttttttcc 2040 

ttcctagaat actttaaggc aaatcccgca tgtgctttta catctctgtg tctctctcta 2100 

gtaagaatgt tcttttcagc aaaatcacag aattagccat tactccttta ttacctaata 2160 

cccagactgt ttaatttttc tagttgcttc aagtgtctgt ttacaattta cttgtttgaa 2220 

, ttcatttcta agrtactcag gaggctgagt caggagaatt gcttgaaccc aagaggcaga 2280 

ggttgcagtg agctgagatc gtgccactgt actccagc'ct gggcgacaoa gtgagactcc 2340 

atgtcaaaac aaaacaaaac aaaacaaaac aaaacaaaac aaacaacaca attgaataat 2400 

acccctcagt tattatttag agaagagatg : actcaatata aatgtaatat gtgatcactt 2460 

ggtcacagaa tgaatztcag -cattgctta ggtaagcctg gr.aaaaaaaa aaaaaaacgg 2520 

gattccatga. ttagtgtgtt gaaaaggggc ttgccaaagt gtaacagaca gacccgggtg 2 580 

cagtggctca cgcctgtaat cccagaactt tgggaggcca aggcaggcgg atcacgaggt 2640 

caagagattg agacaatccc ggccaacatg gcgaagccct: gtatcaacta aaaatacaaa 2700 

aacaattagc tgggcgtggt gacatgtgcc tgtagtccca gctactcagg aggc-gaggc 2760 

aggagaattg cttgaacctg ggaggcggag gccgcagtga gctgagactg ctccactgca 2 820 

ccccagcctg gccacagagt gagac^ccat ctcaaaaaaa aaaaaaaaaa aaaaaa 2876 



<210> 23 

<211> 1052 

<212> DNA 

<213> Homo sapiens 



<400> 2 3 

tcgacccacg cgtccgccca cgcgtccgcc cacgcgtccg ggcgggcgca ggacgtgcac 60 

tatggctcgg ggctcgc-gc gccggttgct gcggctcctc gtgctggggc tctggctggc 120 ' 
gttgctgcgc tccgtggccg gggagcaagc gccaggcact; yccccctgct cccgcggcag ■ 180 

ctcctggagc gcggacctgg acaagcgcat ggactgcagc acctcctgcc cccttccggc ■ 240 

,-tgctttggcc catccttggg ggcgctctga gcctgacctt ' cgtgctgggg 'c'tgctc.ctg 300 ' 

gctttttggt ctggagacga tgccgca'gag agagaagttc accaccccca tagaggagac 360 

cggcggagag ggctgcccag ctctggcgct gatccagtga, caatgtgccc cctgccagcc 420 

ggggctcgcc cactcatcat tcattcaccc actccagagc cagtccctgc ctcccagacg 480 

cggcgggagc caagctcctc caaccacaag gggggcgggg ggcggtgaac cacctctgag 540 

gcctcggccc agggttcagg ggaaccctcc aaggtgtctg gttgccctgc ctctggctcc 600 

agaacagaaa gggagcrrca cgctggctca cacaaaacag ctgacactga ctaaggaact 660 

gcagca-trg cacaggggag ggggtgccct ccttcctaga ggccctgggg gccaggctga 720 

cttggggggc agacttgaca ccaggcccca ctcactcaga cgccccgaaa ttccaccacg 780 

gcggccaccc tggggggtta • gggacctact tttaac'acta gggggctggc ccactaggag 840* 
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ggcrggccct aagatacaga cccccccaac tccccaaagc ggggaggaga tatrtatctt -900 

ggggagagtit tggaggggag ggtggggggg gggaaaaaaa aataaaaaaa aaaattttta 960 

atttttaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaaaaaaaa aaaaadddaa aa 1052 



<210> 24 

<211> 1541 

<212> DNA 

<213> Homo sapiens 



, <400> 24- 
ggcacgagct 
get gga tang 
ctattiggcc 
aacttaatca 
agtcagctct 
ataattccaa 
ttaagtacai: 
atgggaccaa 
atacaaggtc 
tattcaagga 
actctcctgc 
gaatgctcct 
tccgtggacc 
gggctgtact 
taagtatgaa 
ccaaacttgt 
ttttttattc 
atgaattaat 
ctatgaaatc 
tcaaaaggaa 
ttttctaaaa 
tttgtaacct 
cattcccttt 
atatcttggt 
gcttgttctg ' 
acaaacgtct 



:cctcgatcc 

cgggactagt 

ncctaagtga 

cagccgggtg, 

ggctgctggc 

ctttcccgga 

tgggactcgc 

cgctgggcgg 

tatgggctct 

gaaaaagg tc 

ctaatgaaac 

ggcct-aaac 

ccgtgccagt 

tactgtgata. 

aggagtaac t 

ccttaaaga t 

taagtacacc 

tcgatatgcc 

agtttttaaa 

gaattcttgc 

attctgtttg 

acgttcactt 

aatgtctctg 

agatgtattt, 

ttagggaagg 

attgtgtaac 



tactctgtct 
atncctgggt 
taaaaggcca 
ctacatgctr 
•^jctgatatta 
adttctcagt 
atcaggcctt 
atttctgtat 
ga taagtgga 
caaatctcaa 
ctagtccgat 
atcaccgtag 
gtcttggaag 
ctgaaaagct 
aaaaacaagc 
gttgttatta 
ga.ttcngtga 
ctaaatacat 
agatatactt 
ctgccataca 
tgtgaaaagt 
atgatctgga 
CGCtttggct 
'catcccLaga, 
gcttcctcca 
aaaaaaaaaa 



coctttgttt 
acggcactgt 
cctctaagga 
ttagggcetg 
gttgtaagtg 
tgrgcacatg 
cttagtgcaa 
gagaaaattg 
ttagccatg'g 
aacatcctca 
ggatcctgga 
gaagggtttt 
tgtcaacgtg 
gtcctgctga 
aaaacaaaac 
actcgagtta 
atgtaccttt 
aaaggtgctt 
tctcttcgtc 
gaaactccct 
acaagaataa 
tttataaaca 
ctaccatctg 
gcaggtcagc 
acttcgtgtg 
aaaaaaaaaa 



tggagaagtt 
cctatgccat 
aacggcttct 
tcccaatctt 
gcctctctgc 
aaaatgggtt 
tgtggtcaat 
gttttgaatg 
gcttgtctta 
gcacagagga 
tigatacaag 
taaaatttta 
ttcttggatg 
agcagctata 
aagacceagt 
gttcttattt 
tttattacac 
caaaatatgn 
rrtgaggtttt 
agcactccct 
caatacttac 
ctacctggta 
ctttgttttt 
ctcctt'cccc 
aaattgtgat 



caatttacca 

ctcttcacca 

ggtgtttggc 

gcatattaaa 

tggaatgagt 

tgaagaggga 

tggtgctttt 

ggcagcagor. 

tctactggag 

ggaacgaact 

gttgagaaat 

cgcgcaaaac 

atcctgtatt ' 

tttgaaatat ; 

ttttaaatga- 

cctctgttca 

agggaaagaa 

agaaacatta 

tcggtcttgt 

gaccttaagc 

aacctccatt 

taacgttttt 

gtttttatct 

taatgcgaat 

gttgaagcga 



60 
120 
' ,180 
; 240 
300 
360 
" 420 
: 480 
540 
, 600 
•660 
. 720 
780 
840 ^ 
900 
960 
1020 
1080 * 
1140 . 
1200 
1260 
1320, 
1380 
1440 
1500 
1541 



<210> 25 .. 

<211> 2079 

<212> DNA 

<213> Horio sapiens 



<400> 25 

ggcacgaggg 

catgaggatt 

ccgcagtctg 

ccgaaggatg 

gcagcaaggg 

cga'ggtgtct 

ctcttttctc 

gatcaLtacc 

ggagagtgga 

gcttatctcc 

tacagcccag 

gagagcccca 

cagaccagag 



aggcgcggct 
caacaacccc 
actggaggaa 
gctatgagaa 
ccctga tgcg 
ttcagagagc 
tcacagggga 
acgccattcc 
agacagagga 
gggcaggaaa 
ataataagag 
accacg cggt 
acacccacga 



tgtggctaac 
tgcggcacat 
ggtggagcaa 
gctgcaggac 
cactggctct 
tgactgcttc 
atggctggag- 
tgaggcaagt 
tgtttcgcgc 
gatggccagg 
agctgaggcc 
ggggctatgt 



actagattct 
cctcLycLCg 
agtctggagg 
gaccttccag 
gtgtacatgc 
gccatcactg 
caagc.ggca 
gaggctgtca 
ctagaagatg 
gccctcagcc 
aatgtcttga 
gtcatccaga 
cagaccctgg 



acgacaaggt 
catttactct 
ccagtgagaa 
cccttgagga 
tcaatgtgaa 
acctgtacag 
aggtggccta 
gtcticttccg 
ccttggatc'a 
tctctcggga 
aatatgaaag 
ggcccaatat 
gctcccagcc 



actt tctctg 

catcaaacgc 

ca-ccgagct 

ccctgaggga 

aggcctggcc , 

ccccaaacgg' 

tgacatgggg 

aggatcttac 

cttggccttt 

gtttcttctc 

gcdcctggca 

accccacctg 

cacccrccac 



-60 
120 
1B0 
240 
300 
360 
420. 
480 

.540 
600 
660 
720 
780 
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cagatcccta* .gcccctactg ttcccatgag accaattcca acgcctacct gctgctccaq 840 

cccatccgga aggaggccat ccacctggag ccctacattg ctctccacca cgacctcgtc 900 

agtgactcag aggctcagaa aattagagaa cttgcagaac catggctaca gaggccagtg 960 

gtggcatcag gggagaagca gttacaagtg gagtaccgca tcagcaaaag cgcctggccg 1020 

aaggacactg ttgacctaaa actggcgacc ctcaaccacc gcattgctgc cctcacaggc .1080 

. cctgatgtcc ggcctcccta tgcagagtat ctgcacgtcg tgaactatgg catcggagga 1140 

cactatgagc ctcactttga ccatgctacg tcaccaagca gcccccccta cagaatgaag 1200 

tcaggaaacc gagtcgcaac a-ttatgatc tatctgagcc cggtggaagc tggaggagcc 1260 ' 

acagccttca tccatgccaa cctcagcgtg cctgtggtta ggaatgcagc actgttttgg . 1320 

tggaazctgc acaggagtgg cgaaggggac agtgacacac ttcatgctgg ctgtcctgtc - 1380 

ctggtgggag ataagtgggc ggccaacaag rggatacatg agtatggaca ggaattccgc \ 1440 

agaccctgca gctccagccc tgaagactga actgttggca gagagaagct ggtggagtcc , 1500 

tgtggctttc cagagaagcc aggagccaaa agctggggca ygagaggaga. aagcagagca -v 1560 

gcctcctgga agaaggcctt gtcagctttg, tctg^gcctc gcaaatcaga ggcaagggag 1620 

aggtcgtcac caggggacac tgagaatgta cactcgatct gccccagcca cggaagtcag 1680 

agtaggatgc acagtacaaa ggagggggga gtggaggcct gagagggaag tttctggagt 1740 

tcagatactc tctgttggga acaggacatc tcaacagt'ct caggttegat cagtgggtct , 1800 

tttggcactt tgaaccttga ccacagggac caagaagtgg caatgaggac ac^tgcagga ,1860 

ggggctagcc tgactcccag aact t Laaga 'uctLCLcccc actgccttct gccgcagccc 15120 

aagcagggag tgtccccctc ccagaagcac accccagatg agtggtacat tatataagga 19B0- 

ttttttttaa gttgaaaaca . acttcctttt ctttttgtat gacggttttt taacacagcc 2040 

attaaaaatg 'tctataaatc aaaaaaaaaa aaaaaaaaa . - , ■ * . 2079 



<210> 26^ 
<211> 1947 
<212> DNA 

<213> Homo sapieris 



<400> 26 * 

tgtaaacaga ttggagaacc tagcaataag attcaaagct aatctggagc ataaaggcac 60 

agttcagaga cagaacaaca gggatcacaa gcacgaatta aaaggaattt attcgcttca . ; 120 

agttcctaga tacaaccttc ccatgctgca cctctccact gtcggagcac. gttccgaaaa, < . 180 

acagaatgcc tcgatccctg gtgggtgcga aggcagtcgt tagggatggc aggcattggc , 240 

gggctccaaa agatgaaggc cccacacaca ggtgtgctgc atttgggatc tgtgtgggtg \. 300 

tttcttggac cctttcttct gggagtaggg tacacactaa cgtttaatcc gctgtctggg 360 

tgcatgccca cagtacggtg gctaaactcg aacatcactg caaataggac gctgagcagg -/ 420 

tccgtctgtc atgtcacgcc actgcacagg - tccttgtccc cacacgacgg ggagtacttg .480 

cgtcagatgt tattgaacag ctcgtctcgg gcaggggaag cggggagttg gggatattaa 540 

ttgggggttt taattctatc atcatgtcag ctgacattat gactatataa tgtagctaga 600 

gacaattttt atcttgctta tagtaaaggt tcagcctgcc aactgtaaat cattctaatc ■ 660 

tggcaggctt atttttgaca ttggaaaggg. cagaaagcga tttgccccag ; tagtgtaata ,.720 

ggagttatag .accagaggct gaaacccaaa ctatataaaa aggaattcag -ggagggggc ' 780 

tttgtoatct ccattaatt: t gtgttgctac ttccaggatc accaaaaacz acacgcaact 840 

tcacatgtta aacacattga aacataacct atgtttataa agcataacgg gcttcccttc 900 

cagaagctct cctgcttgtc atgaagtgaq aacaatqaaa agtcatagca gatactcagt 960 

ttaactctgt gtagaaccrta gtagtgtttg agctgttact cagartcgaa ttcagactgt 1020 

gtgttgtttg cttacggaca ctgcctgtcg tnctgtcact gttaoattaa tgagtctata 1080 ' 

aggttrttct tccagaggcc ataggtgaca tcactaaaat tgcaagataa at:tgcaatct 1140 

tcgctgctgc >tgcactcccc aacctctccc ccaccccccg tggtgtgctg ccttctagat 1200 . 

gagcgtgttt tggagcaggc ccatctggga cactccacgc ttccaccaag gaagr.gcgat ■ 1260 

ctgagcagcc acaatccagc caaaagagga ccgtagatat ttgctctgat caactagatg 1320 

aaaatatagc agaatggatc ragcccactg ctctgtttta tccaactgag cctc^gacca 1380. 

gcaattggtg cataactact acagcaaaag ttaagaaatg aaaccgtagc aattatgcaa > 1440 

atgaatgtgt tggcctccta atacctgtta ctagtggact tcctgtgagg aagctagttc 1500 

tttgttttga tgaaatgccc tcgtttttta aar.ctt-.aat.r. rr.gctgrcca catcctccca 1560 

aagcgtgct" acttcatttg trtaattcaa atgaactttc ctccttgtat gtatgaggtg 1620 

acttgg-ggg tggggtgggt ggctcttgtt tttgtgtttt. ccctttctta gggbatctgz 1680 

aggcctcaaa ggacccctcc ct taggtcat ' actcctcaga aagrcctcaa ccttcccttg 1740 
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tcttcgttzg , tctgtttttc ctaaagaata tttrcaaagc CCaaatttgc atattaat^t 1800 

aggaccat-t agaagtatag gctgtcgttg gcggcagcag tatattctga aacgtctcat i860 

agatatatat rtttgaataa agatgg-gtt g^tgaacaaa aaaaaaaaaa aaaaaaaaac 1920 . 

tcgagggggg, gcccggtacc caattcg 1947 



,<210> ; 27 . ' • 
<211> 3379 
*212> DNA 

<213> Homo sapiens- 



<400> 27 , , • 

atgatattca agccgaattc ccctcactaa agggaaaaaa gctggagctc caccgcggtg 60 

gcggccgctc taaacctact ggatcccccg ggatgcagga,-attcggcacg aggtctcggc 120 

tccctccggu ac-ccgcccgg cccggccgcc cccggcccct . cccggccatg gggagctgcg . 180 

cgcggctgct gctgc-ctgg ggctgcacgg tggtggccgc aggactgagt gqagtagctg 240 

gagtgagttc ccgctgtgaa aaagcc'tgca accctcgga- gggaaatttg gctctggggc ■ 300 

gaaaactctg ggcagacacc acctgcggtc agaatgctac cgaactgtac tgcttctaca 360 

gtgagaacac ggatctgact tgtcggcsgc ccaaatgtga caagtgcaat gctgcccarc 420 

ctcacctggc ccacctgcca cctgccacgg cagactcatc cttccggtct ccccgcacat. 480 

ggtggcagcc tgcggaggac gtgcacagag aaaagatcca .g'ttagacctg gaagctgaat . 540 

tctacttcac ticacctaatt gtgatgttca agtcccccag gccggctgcc atggtgctgg ; 600 

. accgctccca ggactttggg aaaacatgga agccttataa gtactttgcg actaactgcc 660 

ccgcracatt tggcctggaa gatgatgttg tcaagaaagg cgctacttgr acttccaaac * ' 720 

actccagtcc ttttccatgc accggaagga aggttatttt caaagctttg tcaccaccat 780 

acgatacaga gaacccctac ag-gccaaag ttcaggagca gctgaagatc accaacctcc 840 

cgcgtgcagc tqctgaaacg acagtcttgt ccctgtcaga gaaar.gacct gaacgaagag 900 

cctcaacatt ttacacacta tgceatctat gatttcattg' tcaagggcag ccgctcctgc 960 

aatggccacg ctgatcaatg catacctgt t ' catggcttca gacctgtcaa ggccccagga ' 1020 . ■ 

acattccaca ' tggtccatgg gaagtgtatg tgttagcaca acacagcagg cagccactgc 1080 

cagcactgtg ccccgttata caatgaccgg ccatggcagg cagctgatgg caaaacgggg li40 

gctcccaacg agtgcagaac ctgcaagtgt aatgggcatg ctgatacctg tcaettcgac 1200 

gttaapgf.gt gggaggcatc agggaatcgt agtggtggcg tctgtgatga cr.gteagcac 1260 

aacacagaag- gacagtattg" ccagaggtgc aagccaggct tctatcgtga cctgcggaga. '1320 

cccttctcag ctccagacgc t-gcaaaccg tgttcctgcc atccagtagg atcagctgtc 1380 

cttcctgcca actcagtgac cttctgcgac cccagcaatg gtgactgccc ttgcaagcct 1440 : 

95ggtggcag ggcgacgttg tgacaggcgc atggtgggat actggggctt cggagaccac 1500 

ggctgtcga'c caegtgactg tgeagggagc, Cgcgacccta tcaccggags. ctgcatcagc 1560 

agccacacag acatagactg gcatcatgaa gttcctgact Lccgtcccgt gcacaataag 1620 

agcgaaccag cccgggagtg ggaggatgcg caggggcett ctgcacttct acactcaggt 1680 

aaatgcgaac gtaaggaaca gacattagga aatgccaagg cattctgtgg aatgaaatat 1740 

tcatatgtgc taaaaataaa gattttatca gctcatgata aaggtactca tgttgaggtc 1800 ■ 

aatgtgaaga ttaaaaaggt cttaaaatct accaaactga agattttccg agggaaagcg 1860 

aacattatat ccagaaccat. ggecggacag aggatgcact tgtccaatcc tcaatcctgg 1920 

tttggaatac ctcgtagcag gacatgagga tacaagaaca ggcaaactaa ttgtgaatat 1980 ' 

gaaaagcttt gtccagcact ggaaaccttc tcttggaaga aaagccacgg atattttaaa -2040 ' 

aagagagtgc aagtagcatt aagatggata gcacataatg gcactrggct atgtccaaaa 2100 

cacaaacttt agagcaacaa gacctcagac aggaaactgg aatttcttaa agtgccaaaa 2160 

catatagaaa tgtttgaatg catgggtctt Latctaattc atcccttctg gacccatgtt 2220 

taaatacagt tccatttcat gaagagaaaC gaaaacccct acactgatat ctgttttcta 2280 

tgg?ractgat tctgaaattc ttaactatta " agaatac- tt aacagcagca tgacatttag . 2340 

cagtaatcca ttaagggcag cacctctaac aaggacgcct tccagcttca gccacgttac 2400 

tcacgtttga tgctacttaa agLaatgaat gacg-cttaa ggaatcccta acccnactat 2460 

cagaaa'aggt gttcgttaaa gagccttctc ttgtgcgtca cgcacgaact rcgctctgta 2520 

ggtgttaaat ggaccctctc catgtgtata tagtatttcc ttgcataaag cactttacta ■ 2580 

cctaccactt gtgttgtgaa cgrttggtga ccgctgttga aagaaggaaa agggtgtgtg 2640 

agaaagccta ccgaagcagc agcactgcca ccacatgtgg acaaaagtga acatacaaaa 2700 

gaagttgtgc ' tattcaactc .tgaatacttg gagaoactag gtgaagatgc aaccagaaag „ 2760 

gagaata^gt acgcgLgaag Lctcagctcc gagctggagg ccagattcca agar.gacagc 2820 
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cacgatgaaa ccttctaaaa aactaaacca gaagagactt taaaataaga gaaagaaatc 2880 

ataaatgtag acatatgctt ggctaaaggg gaaatggact ttaaatttta aagagctcat 2940 

ttgcaatigca cLtgtataca cttcaaaaat tattgtagac acagaatttg ttatattttt 3000 

gtgcttagta tttaascctg aacattgaaa cagttttcct ccttgtcctt ctcaacagta 3060 

acagtcatta catttacctg tttcccaaca caacgcacgt gacagt:caaa aaatcacagt 3120 

ttctcattac tatccatcct ctgtacccac gcataaccac catacatagt ttcttttgta 3180 

ctcgaatata caaaacacga acacagtgcc acacgaataa -ttcacatac agaacctittt 3240 

tttctctgaa gccctigtgga cttgcaaata -atatatata tcgccttgtt aatctgtttt 3300 

tatactccac atatgtaata aaggaatafcg atctgaaaaa aaaejaaaaaa aaaaactcga 3 360 

gggggggccc gtacccaaa 3379 



<210> 28 

<211> 2006 

<2X2> ' DNA 

<213> Homo sapiens 



' <4O0> 28 

cccacgcgtc cgcggacgcg tgggattttc ' LcLtaagaaa aaaaaaggaa gacgttttaa • 60 

taggaaaaga aaaacaagaa gcagtagcag ctaggtaatt , taaacgtctt cctttttttt . • 120 

tcttaagaga aaatggaaca "tttaggctaa atgtctt taa attttaccac ttaacaacac 180 

tacatgccca taaaatatac ccagtcagta ctgtatttta aaatcccptg aaatgatgat 240 

atcagggtta aaattacttg tattgtttct gaagtttgct cctgaaaact actgtttgag t 300 

cactgaaacg ttacaaatgc ctaataggca tttgagactg agcaaggcta ctzgtLatct 360 

catgaaatgc ctgttgccga gttatcctga atagaaatat tttaaagtati caaaagcaga ' 420 

tcttagttta agggagtttg gaaaaggaat tatactcccc tttttcctga ttctgtactc 480 

aacaagtctt gatggaatta aaatactctg cr.ttattctg gtgagcctgc tagctaatat S40 

aagtattgga caggtaataa tttgtcatc- ttaatattag taaaatgaat taagatatta 600 

taggattaaa caraatttta tacggctagt actccattgg ccgacctaaa tttatagcgt 660 

gtggaaattg agaaaaatga agaaacagga cagatatatg atgaactaaa aatatataca 720 

ggtcaactct ggtctgaaat ccctgaggtg tttttaacct gctacactaa tttgtacact . 780 

aatttatttc tttagtctag aaatagtaaa ttgtttgcaa gtcactaata atcattagat 840 , 

aaattatttt cttggccata gccgataatt ttgtaatcag tacnaagtg:: atacgtattt 900 

ttgccacttt ctcctcagat gatcaaagta agtcaacagc ttatttcagg aaactggaaa 960 

agtaataggg aaagagactt cactatttgc ttcatcagtg gtaggggggc ggtgactgca 1020 

actgtgttag cagaaattca cagagaatgg ggatttaagg ttagcagaga aacttggaaa 1080 

gttctgtgtt aggatcttgc cggcagaatt aactttttgc aaaagtttta tacacagata . 1140 

tttgtattaa atttggagcc atagtcagaa gactcagatc ■ ataattggct tatttttcta 1200 

tttccgtaac tattgtaatt cccacttttg tdataaLLti ' gatttaaaat ataaacttat 1260 

ttatttattt tzttaatagt caaaaatctt tgctgt-gta gtctgcaacc tctaaaatga 1320 

ttgtgttgct tttaggattg atcagaagaa acactccaaa aattgagatg aaatgttggt x 1380 

gcagccagtt ataagzaata tagttaacaa gcaaaaaaag tgctgccacc ttttatgatg * 1440 

atcttctaaa tggagaaaca cttggctgca tccacataga cctttatgtt ttgttttcag 1500 

ttgaaaactt gcctcctttg cjeaacattcg' Laaatgaagc agaatccttc "tcc-CtCCt 1560 

ttccaaatat gttagcttcg ttcttgtaag' atgcatcatg ggtattggtg ctgtgtaatg 1620 

aacaacgaat tttaattagc atgtggttca gaatatacaa .tgttaggtt- ttaaaaagta 1680 

tcttgatggt ccctttctat tcataatttc agactttcat aaagtgtacc aagaatttca 1740 

taaatttgtt ttcagtgaac tgctttttgc* tatggtaggt cattaaacac agcacttact 1800 

cttaaaaatg aaaatttctg atcatctagg atattgacac attccaattt gcagtgtctt 1860 

tctgactgga tatattaacg ttcctctgaa tggcactgat agatggctca gaagagaaac 1920 

tcaacgaaat aaagagaata ttrattcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1980 

aaagggcggc .cgctctagag gatccc 2 006 



<210> 29 . ■ 

<211> 3070 

< 2 1 2 > DNA 

<213> Homo sapiens 
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<400> 2 9 • ! '* 

gttggagcct tagacctccq aagtgaccta tagaaggtac gcctgcaggt accggtccgg 60 

aattcccggg tcgacccacg cgcccgcaac gtatgaggag cctgatttet tcacatcctt 120 

gtcaacatct gttgttgttg ttgctgttgt' tgtttttaat tttagctatc ctagtagatg ,180 

-gaagtggta .tctcgcgctr. cttatmtgca tttccctaat gactagtgac gttgaacatc 240 

ctttcatgtg cttgttggcc attcgtatat 1 cttcttggag aaatgtctat taagtccttt 300 
gcccaacttc aaattggctt ■ g/rctrtttgt tgagttataa ccattgaatt ttaattgctg * 360 

taaaacactc catgagtaca tgtagca-gc actcaataac cattagctaa tactatagtt 420 

gtaactattt ttgccacatg tggcctgaat tggaaggaaa ttgcccagtt cttccaagta 480'. " 

ggagcggaga tgtaacaaaa cctgtattta aggccaacaa agatttgact ctgtcagtgt 540 

tgatactttg cttutaattg ' atgctcccaa atcaagaaat . tagaaacgtg tatacataag , 600 

aatcagttta tct~tggcag i aaaccagtat attatcaatg r.ttaattaat tcctaagaat 660 

taatggcaat agtgcc-.cta gttttagctg agatttagac catcttccct catcctgccc 720 : 

tgttatgcga catttgctat aaatgccaca,atgagaatcc cctgaaacct gggtcagaga 780 

agcctaccct uatatttaa taagtatgtg attgatgtat tatcattggt ttatgcttat 840 ' ' 

tgcatcttgg ttttatgggg tgttcaaaag aagatagatg agaataggtg atccaaagta 900 

atttagaact ttatcctcca gttgggactg aaccatcttg aactgatgaa tatattctac 960 

caggtttata ggtagatatg ccgtttcatt atatcggtga tttggagtga gacagtagct 1020 

tttnagcaga acaacagcca tatgtaayLc LaaatLtggt tattttcttt gaataagcaa . 1080 

cggaaactgt angcattcag gaactggtga tccagatatc ttctgaaggc cctctctccc , 1140 

ctgtgccccc acagcttttg- ccctgaactc tcctgtctcc ttgcgatgcc tcrgtgccta 1200 

tacctacctz gtgaacictg ccrgcaatgc 'cgtcccaagt cagttggtga ctcctcaaac ' 1260 

ctcactgaag tctttcgggg tggattcaaa tgctgctttt agccattaaa atctgrtaca 1320 
tctgttacat actttcacaa tagcacctat- cctagtgtag tettaaaaat gcactcaaaa t 1380 

atttttaaag cagtgcatgc atgcagtaaa caaaattgga cagtacagaa ataaatggtg 1440 

aagacagtct cccagttttc ttccccatgg atagcactct ctccrtttta cccttccaaa 1500 

gatattttgt gctgatacaa gaacr.r.ttat, accgtacgaa gtgcctattc atcttatctt 1560 ■ 

ttacttcagg gtaatttatg agtaatgaaa aaattcaagc attacaaaaa tgtagagtgt 1620 

gtgoagcaaa aggtccccrt tactttgacc cctaacacta gcccttccca aataacactg 1680 

tcctaattaa cactcccccc tccagaagta acctctgtta taagccttgc ctgtgccttc ' 1740 

agattgttct gcttacactt afcatgcctat ttaaaagtgc atatttttgt actttgcttg 1800 

tttgtttgtt tgagacagag tctcgctctt gttgcccagg ctgaagtgca atggcactat 1B60 

ctctgctcae tacaacctgt gtctccctgg ttcaagtgat tctcctggct cagcctcctg 1920. 

agtagctagg atcacaggtg cacgccacca tgcccggcta atttt tgtat ttttagtaga 1980. 

gatggagcct caccatgttg gccaggctgc tcacgaactc ctgacctcag gttatccacc 2040 

caccttggcc tttcaaactg ctgggattac aaggcatgag ccactgtgcc cggccaagaa 2100 

tttrtt^atc gataccatag tgagctctct gcctettcgg aacgatgtcc act ttgctta 2160 

Cgatcaaccc aagcaggact cttc't=tccc tggacgcctc tccccnggtc tggaatcttc 2220 

cagtictgcc agaattggcc tttcccagat.gctgcaaact' tccagugaa cccctrtttc 2280 

tgtgtggccc crggggctgc gagaccaaaa tccatgagtt ctgtgtaccc tagacctttg 2340 

gaaggtgaga gcagggccct gagaaaaggc aaaacgccrc ctctccctga tgatacccct 2400 

acccgcccta ctcctcacca gaattgtcag tggcetttca ccacagtggt cctt'cctgcc 2460 

tgagccctgc actgtcccag accacacaga agtctggtca cctctgggcg cctgggatgg 2520 
tcaccgaaga gaagcacgt'U ytccccgtct tctcctggct tccgccagaa aatcgaacaa . 2 580 

gzgcaattaa cacactgcta ctgccgaagc ctgaaactcc caggacrtgt ccttgatcct 2640 

tccagaaacc accaggtccg gcacttggag ccccccggac agggacctcc cagccgagcc 2700. 

c-caaagaac tccatgaaat caggaactgc ttgatgaaat gcatctcctt gtacctggaa 2760 

gatgaagccc aaacacccac acctctgtct cccccagggc tcgggatgtc tccagcagcc 2820 

,cggccacgca gcttcccagg tgggctcggg gaggtgggag' cagggaccat ctctgccccc 2880 

cccacectca ctccatccac ctcggagacc accccccccc agccagatac ggaataaaac ' 2940 

tacagacgca gaaaaaaaaa aaaaaaaaaa gggcggccgc "tctagaggat ccctcgaggg 3000 

gccccagcct acgcgtgcat gcgacgtcat agctczcccc ctacagtgag tcgcattata 3060 

agggagcaaa * ■ •' '3070* 

<210> 3 0 . ... ^ 

<211> 2 227 

<212> DNA . : 

<213> Homo sapiens 
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<220> 

<221> SITE , - f . I 

<222> (289) 

<223> h equals a,t,g, or c 



<400> 30 - , " 

cggacgcgtg ggtvr.gaccca cgcgtccggg aaaaarggaa aaracgccgt gtaaaatctc SO 

gttctgtgtc tgaattgccg taggctcaga ccticattcg aggttctgtg tctgaategc 120 

cgtaggctca gatcttcaLt Lgaggttatg ttctataagt taacgctgac cttgtgtgag 1B0 

cttccggtag ctggagtaac acaggcggcc tcacagcgac ctctccagcg ccttccaagg 240 

cacatctgca gccagcgtaa ' tcctcctggg agatqcctcc tcaaggccnt gccccagacc 300 

acgtggggar ggcctgacar ccaatcccca ; ggctgtcccc accctcgrag agtgacccta 360 

aacgctagac agatggggaa tgggaaagaa aagaaagctg cagacctcaa gttaa'aattc 420 

cctcaaaaac gctcLLat_tL' atctgcttct tctgaaagga taaaggctct ttgaaaatta . 480 . 

ccttctaaca aataacatga acacttctag aaaccctaga aaaacacaaa gcattcaaaa 540 

tagaaagaaa aattacccat tactctttaa gccagcatta tccattgcgg igcttttgga 600 

gxtgggtgag ^ccgtagcct ctgccaagtc aaggagcccg gtggtggctg tggcattcct 660* 

gcagggttgt Ctttttttct ttgagatgga gtctcocccc tgtcacccca gctggaatgt 720 

ggtggtgtaa acagctcact gcagccttga ccctgaggct caagcgaccc ttctgccttg . 7 B0 

gcctcctgag tagctgggat cccaggcgag agtcaccaca ccctgtccat gttcctgcag -840 

gtcttgatat gcgaggacgc tgcgtcntcc ctgccacact, ttctrcttct ttcttgagac 900 

a'gacccttgc tccatcaccc agg'ccagagt* gtggtsgtgc gaacacggct cactgcagcc 960 

tcgaccctca ggctcaagcg atcctcacgc cteggacccc caaagtgctg ggatcacagg 1020 

cgagagtcac catgctggcc cgaatcctca gggtatttca cggttgaagt gtcactcact 1080 

tarccacscc* cgttccaaga gtgtaggtgg tcaccctgtc tccgycgctg acctggcctg 1140 

gaccctcggc tgtgaqaggg aggggtgggc tgggctggag gaacctraag ccctcgtgat 1200; 

gtcacaagcc catcrggctg ggcatcccct gctgtgtcct gagctgcaca tgccccaggt 1260 

ggcceccaca gcagaggcga gccactgrag ggtgragggc ttccacggac ggtcttcagg 1320 

ggragaagaa gggcccaggc ccccaggaga ctcaggagac cagagcctgg ggtcaggggc 1380 

cyagcagggg ctyarccagg gctggatgtc cggagccagc cccgmagccc tgkgJctcttt ' 1440 

gttcttcgca ctcccaccgt ccgtgtgaac agctccagcc ccacctgcgc ctccctgtgc ■ 1500 

tgggctccat cagggagccc agaagacgtg tgtgcttctg aaattgggtc cctacatgcc 1560 

tttgtcccag tgcaccttgc tccttccatt tactatcgag atttaaacgc ctgttttctc . 1620 

cccagaggtt gacggatata ttcagacgtt acgacacgga tcaggacggc cggattcagg 1680 

tgtcgtacga acagtacccg tccatggtct tcagtatcgt acgaccctgg cctctcgtga 1740 

agagcagcac aacatggaaa gagccaaaat gtcacagtcc ctatctgtga gggaacggag. 180G 

cacaggtgca gtt-.agatgct gttcttcctt tagattrtgt cacgtgggga cccagc-gta .^860 ■ 

catatgtgga taagctgcttt aatggttttg caactgtaat agtogc-gta tcg~tc-aat ' ,1920 

gcagacottg gatttgg^ga ccgtcccacc gtgccatgag gtaaacgtaa ngtctcaggc '1980 ' 

actctgctt:g._*caaaaaaatc tatcatgcgc ttttctagat gtctc-ggyt ctatagcgca 2040 , 

aatgctttta ttagccaata ggaattttaa aataacatgg aacttacaca aaaggctttt 2100 

catgtgccct acrtttt taa aaaggagctt attgtattca ttggaatatg tgacgtaagc 2160 * 

aataaaggga atgttagacg tgtaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220 - 

aaa&aaa 2227 



<210> 31' 
<211> 1288 , 
<212> DNA 

<213> Homo sapiens 



<400> 31 

cccccgggct gcaggaattc ggcacgagcc tgacctcctc agcctcatct ctccctcctc 60 
tgctctcgcc ctccgtgctc cagccaacgc ggcctgccac tcgtccacct gccatactgt ' 120> 

cctgactcca ggcctttgcc tgtgctatgg cctccgctqq .qaccac^ctt 'gtctctcccc 180 
tgctgtgtcc gctaattccc actcgtgtca gtgatccatg gctgcagaac acaccactcc " 240 

atccatggaa 'aacaatcaca atcactgatt actatctctc cctgggcttc ctggggtgga 300 

ctgggcccag ctgggtggct cactttgggg cfLcdgcagL t_-ciLyqgcaqa cagtggccgg 360 
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ggtcactgca aagact-ccc tgcatctctg gcagttgatg ^ctggctgcca tc-gagacac 420 

ccacccaggg cccctccctg gggcctgggc tcctgc-tag cttggttggg aggctccaag . 4 SO 

accaacatcc caagaaagat gagacagaag ccagatcacc tttttgggcc tggct-caga > 540 

agtcacccag caccacttct gctgcattta tttcttaaaa cacaaatatc aaaccccatc 600 

tcttgatggg "agggggcctc atggtttgta aacatgttct aaactccact ctgcccggcc 660 

ttggc^caac gtgtctggta atgtgtgggc cgcgaggctc cccgaacgta gacctcagac 720- 
cgcaacgctg gccgttacac gctctggcac acgggcccac gccaggccca tcgccacagt - 780 

gatggttgtt ctgigactgt ttctggtggc ctctgctcca pactccaggc tgacgctgtg 840 

ccccttcrac tgggaccctc gggtggcttc catgcacttg tgceczaaat cc:gctccte . 900 

gactaaactt catctcccgt gttctcatcc tgcagcatgg ctgttaggga acctgaccat 960 
ctgcagcgcg Lctcgccgcc aaggtataat gtcagtgcct cccttcagtg gcwcccatgt ' 1020 

cacagaattg tcctgcagcc ctggcacatg tgtgccatgt gggagctggg gcaggtcctc 1080 
cttcatcctg tggctccgag ggagggggcc gctccttcc^ cagcctceac cctgacctgg ■ 1140 

ccctcgtcct. gcagccactc agagagcacg atggagctgg' agctxcagtt ctgaccaaat .-' 1200 

gcgtgtcgcc ggcttttg^g tgtgtgtgLg Lytgacagag ccagaccctg tctttaaaaa 1260 

aaaaaaaaaa aaacccgagg gggggccc. ' . 1288 



<210> 32 
<211> 3280 
<212> DNA 

<213> Homo sapiens 



<400> 32 " ■■ ' ' 

gcgtccggtg agaggcaagg act tt teat c cttagggcat tctgaccacg tcctgcttca 60 

gagagattgc ccccggcgtc teagtgetat ggggagcagg ttcctcctgg tcctgctctc * 120 

aggtctcact grcttactgg ctcrgccagg atcaqaacjcc aagaattctg gage t tec eg 180. 

tcctccatgc cctaaatatg ccagctgcca caacagcacc cactgcactt gtgaagatgg 240 

czttcgggcc aggtctggca ggacatactt ccatgattcc cctgagaagt gtgaagatat. 30Q 

taatgaatgt gaaaceggge tggcaaagtg caagtataaa gcatattgta. ggaataaagt 360 

eggaggrtae atctgtagct gtttggtaaa atatacttta ttcaactttc tggctggtat 420 

tatagattat gatcatcegg attgttatga gaacaatagc caagggacga cacagtcaaa 480 

cgtggatact tgggtgagtg gggtgaagcc tggatttggg aaacagctgg tacgtacaac . S40 

tacgccatct ccctacccaa acattaacat gtcttcctg t gattcttagg gtagggtagt* 600 

tctatccagg ggtaattttg ucctctgtcc caaggtcatc cgtcaacgac tggggacact 660 

mtggttgrc ataccteggg ggtgatgtgt gtgactggca tctggtggat ggagaccagg 720 

gatacagctc aacacccuacagtgcccagg acagcctccc acaatcaaga agr.gcccagt 780 

gccatatgtc catagagata gagaaataca agtgtagggg gaaagtgcct cagctggca- 84 0 

ncaaagaccc acarcaagca cctagattct caatgccaca cgcaccttgt agcaccztaa 900 

taaaxatcgt tgccttctgg gctccccact ccaacacttg' tgeatattec ctatttctta * 960' 

ca'tttcagta agagtcaatc taattagctt aatttttttg gaaggaaaat ctgagaagaa 1020 

atggaagcag agaggacttt gcaagaaggg ctactcaact aacccaaagc gtggagttga 1080* 

gcacctggaa tgcgagtttt gcttctccag gaaagggtca aatttctgaa tctgatatag 1140 

tctatgaaac caagaggtgc aatgagacaa gggagaatgc tttrctggaa gctggaaata 1200 

acaccatgga tatcaactgt' getgatgett taaaaggaaa cctaagagag ageactgeag 1260 

Ltgccctatc acttatcaat ctcttgggga tattctgaat g'catcctttc ttagtaaacg ' 1320 

aaaagggatg caggaagcaa aactgaactc ttacgtcgtg agcggcacca ccggtttgaa 1380 

gqaaaaaatt tccctctcr.g aacctgtgtt cctgactttt cgccazaatc 'agcctggtga - 1440. 

caagagaaca aaacatatct g'tgtctactg ggagggotca gagggaggee . gctggtccac 1500 

ggagggctgc tcccatgtgc acagcaaegg ttcttacacc aaaegcaage gcttccacct t 1560 

. gtccagcttt gccgtcctcg tggctcttgc ccccaaggag gaccctgtgc ugaccgtgat . ' 1620 

cacccaggtg gggctgacca Cccctctgct gtgcctcrtc rr.ggccatcc tcaccttcct 1680 

cctgtgccgg cccatccaga acaccagcac ccccctccat ccagagctct cicctctgcct '■ " 1740 

ctccctggcc cacctcctgt tectgaeggg catcaacaga actgagcccg aggtgctgcg '1800 

ctccatcatt gcagggcugc cgcaccccct ctacctggct tgcttcacct ggatgetect 1860 

ggaagggctg cacctcttcc tcaccgtcag gaacctcaag gtggccaact acaccagcac 192 0 

gggcagattc aagaagaggt tcatgtaccc tgcaggccac gggatcccag c-gtgatcat 1980 ' 

Cgctgtgtca gcaacagttg gaccccagaa tratggaaca tttactcact gttggctcaa ■< 2040 

gctcgataaa ggattcatict ggagcttcat ggggccagta ytjayLcaLta tcccgataaa ' 2100 
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cccggtgttc cacttccaag ttctgtggat cttgagaagc aaactttcct ccctcaataa ' 2160 

agaagtttcc accaitcagg acaccagagc cacgacattt aaagccattt ctcagctatc 2220 

tatcctgggc tgttcttggg gccttggttc ttttacggtt gaagaagtag ggaagacgat . . 2280 

, tggatcaatc actgcatact * rua ttcacca t catcaacacf: cttcagggag tgttgctctr 2340 

tgtggtacac tgtctcctta accgccaggt tcgaacggaa tataaaaagt ' ggtttagcgg 2400^ 

gatgcggaaa ggggtagaaa ctgaaagcac tgagargtct cgctctacta cccaaaccaa 2460 

aacggaagaa gtggggaagt cctcagaaat ctttcataaa ggaggcactg catcatcatc 2520 

tgcagagtca accaagcaac cgcagccaca ggttcatccc gtctctgctg cttgcctaaa 2580 

gacgaacrga cctggcaagt gccatggcaa cgacccggaa gttacccctc ccttccgttt 2640 

gtctacagcg cccctgcggt cacacataga ctggacaaat gccactattc ctagccttcc 2700 

tgtgaaaagr ccaggcccat tcacctactt tggcttttta tgttcacaga aagaacaaga 2760 

catttgggag aattcttaga cccagagccc " agtagtgtgg. cacgtgcaat gaagtgtcgg 2820 

aaggatgcat tttaaagatg gcgggcggga. ' gaagtggat t ttcttcttgc agcractgcc ,2880 

accttgccag aaactcacta actggcatct ggattcagct catagttccc tttctggcct 2940 

ccccgccgca ttttatgccc ccaaagacct tacattaaca ccccacaccc acataattca 3O00 

acaattttca tatggatcag tattaaagag ggtgttgcat tttgcaatac aaaaatgcat ■ t 306Q 

-atcaggtgc tggagaggat Qcggagaaat aggaacactt ttacactgrt ggtgggactg 3120 

Laaactagtc caaccattgt ggaagtcagt gtggcgattc ' ctcagggatc ' taggactagg ' 3180 

aataeeattt gacacagcta tcccattact gggtatatac cccaagggct ataaaccaLy * .3240 

ctgctitaaa gacacatgca cacgcaaaaa aaaaaaaaaa • .32B0 



<210> 33 • / 1 ■'. 
<211> 1297 • .' . , ' 

<212> DNA • . ' • r . ; ■. ' : . 
<213> Homo sapiens * 

' <400> 33 . . . - '■ ' 

■ ccacgcgtcc ggactgcttt acggacattg gatgaagccg aagcattcag aatggcgcct * , 60 

ggcacacagt tggtgcgtga ^catggtcaag, ct ttgtgtcc ccacccacat cccatcttga 120 

atgtgacggt ttecccggct "ccctcccgcc gccatgtgaa gaaggtcgtt gcctcccctit 180 

cacctcccac caccatgatt gccatggatg ctccccactc caaagcagcc / c t ggacagc a 240 

ttaacgagct gcccgagaac atcctgctgg* agctgttcac gcacgtgccc gcccgccagc 300 

tgctgctgaa ctgccgcctg gtctgcagcc tctggcggga cctcatcgac ctcatgaccc 360 
tcLggaaacg caagtgcctg cgagagggct' tcatcaccaa ggactgggac cagcccgtgg . 420 

ccgactggaa aatcttctat ttcctacgga gcctgcatag gaacctcctg cgcaacccgt . 480 *' 

gtgccgaaga gganatgttt ■ gcatggcaaa ttgatttcaa tggtggggac cgctggaagg 540' 

tggagagcct ccctggagcc cacgggacag attttcctga ccccaaagtc aagaagtatt 600 

ttgccacatc ctacgaaatg* tgccccaagt cccagccggt ggaccttgta gccgaggguL 660, 

actgggagga gctactagac acattccggc cggacatcgt ggttaaggac tggtttgctg ' 720 ' 

ccagagccga ctgtggctgc acctaccaac tcaaagtgca gctggcctcg gctgactact 780 

^tcgtgttggc ctccttcgag cccccacctg tgaccatcca acagtggaac aatgccacat 840 1 

ggacagaggt ctcctacacc ctctcagact acccccgggg tgtccgctac atcctcttcc 900 

agcatggggg cagggacacc cagtactggg caggczggta tgggccccga g^caccaaca 960 

gcagcattgt cgccagcccc aagacgacca ggaaccaggc cccctccgag gctcagcctg 1020 

ggcagaagca tggacaggag gaggctgccc aatcgcccta ccgagctgtt gtccagattt 1080 

. tctgacagct gcccatcctg tgtcfcgggtc agccagaggt tcctccaggo aggagctgag '1140 

catggggtgg gca'gtgaggt ccctgtacca gcgactcccg ccccggttca accctaccag . 1200 
cttgtggzaa cttac-gtca catagctctg acgtt ttg tt gtaataaatg ttttcaggcc \ 1260 

gggcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 1297 

<210> 34 . ' •• ' • 

<211> 2184 ' . , 

<212> DNA ■ ' ' ' 

<213> Homo sapiens ~ * 

<400> 34 ' 1 ' •• ' • 

ggcacgayyc gcccgaggat gtgczgctgg ccgccgctcc cgccgcgggg gctgcccccc 60 
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gggacggcgg cggggggccc gggccgaacc tatccgcacc ggacccccc- ggacccggag 'l20' 

ggcaagtact ggctgggctg gagccagcgg ggcagccaga tcgccttccg cctccaggtg 180 

cgcaczgcag gcnacgtggg cttcggcct= tcgcccaccg gggccacggc ' gtccgccgac 240 

fltcgtcgtgg gcggggxggc ccacgggcgg ccccacctcc aggattattt taca&atgca . 300 

aatagagagc cgaaaaaaga cgctcragcdd gattaccatc tagaatatgc ca'tggaaaat 360 

agcacacaca -caataattga attraccaga gagctgcata, catgegacat aaatgacaag • 420 

agtataacgg atagcactgt gagagtgatc tgggcctacc accatgaaga tgcaggagaa 480 

cctggtccca agtaccar.ga ctccaatagg .ggcaccaaga- gtttgcggtt at-gaatcct 540 

gagaaaacta gtgtgctatc tacagcctta ccatactttg atctggtaaa tcaggacgtc 600 

cccatcccaa acaaagatac adcatatcgg tgccaaatgt ttaagactcc tgcgttccaa 660 

gaaaagcatc argcaacaaa ggttgagcca gtgatacaga gaggccacga gagcctggtg \ 720 

caccacatcc tgctctatca gtgcagcaac aactttaacg acagcgctct ggagtccggc 780 

cacgagtgct atcaccccaa cacgcccgat gcattcctca cctgtgaaac tgtgattttt '. 340 

gcccgggcta ttggtggaga gggcttttct tacccacctc atgttggatt atcccttggc 900 
actccattag atccgcatld Lycgctccta gaagtccatc acgataatcc cacttatgag . 960 

gaaggcttaa cagataatcc cggacrgagg ttactttaca caatggatat aaggaaatat 1020 

gatgctgggg r.gattgaggc tggcctctgg grgagcctct tccaiaccat ccctccaggg 1080 

atgcctgagt tccagcctga gggtcactgc acttcggagt gcctggaaga gctctggaag ' 1140 

ccgaaaagcc aagtggaatt catgtgtttg ctgttcttct ccatgctcac ctyyctggca .1200 

gagcatcagg ctgcgtca.t ttcgaaaagc gaaggaaatg aaantacttg cctatgatga ,1260 
cgatcc-gac ttcaatttcc aggagtttca gtatctaaag gaagaacaaa caatcttacc t 1320 
aggagataac ctaat tactg agtgtcgcta caacacgaaa gatagagcr.g agacgacttg * 1380 

gggaggacta agcaccagga gtgaaatgtg tctctcatac cttctttatt acccaagaat 1440 

taatcttact cgatgtgcaa gtattccaga cattatggaa caacttcagt ccattggggt 1500 

taaggagatc tacagaccag tcacgacctg gcctttcatt atcaaaagtc ccaagcaata 1560 

taaaaacctt cctttcatgg atgcratgaa taagtttaaa tggactaaaa aggaaggtcc 1620 

c^iccctcaac aagctggtcc tcagcctgcc agtgaatgcg agatgttcca agaragacaa 1680 

tgctgagtgg tcgattccaa ggaangaeag cattaccccc agatatagaa agaccctata 1740 

aagccagaac ectttggtgt gtggcacgtc ttcttccctc ttcccctgcc acagagatnt 1800 

ctcccatcca acttgcttgt ttgccttctg c~aactcagc tgcacgctga gcaccaagag 1860 

cttgtgatca . aaattctgt t ggacctgaca atgttttcta tgatctgaac ctgtcatttg 1920 

aagtacaggt taaagactgt gtccactttg ,ggcatgaaga gtgtggagac ttttcttccc * 1980 

cattttccct ccctccttrt tcctttccat gttacatgag agacaccaat caggttctct 2040 

cccccttctc agaaatarct gatgttatat. etacotggtc aacaaaataa aactggcctg 2100 

'acttaagaca accatttcaa aaaattgggc cgtcatgtgg gaataaaaga attctttctt 2160 

ccccaaaaaa aaaaaaaaaa aaaa 2184 



<210> 35 
<211> 949 
<212> DMA 

<213> Homo sapiens 



<400> 35 

ggcacgagct cccattgacc tcatLgcLca gagcgcagca ttagggcaag gtttcgccac SO 

tgcctccctc catgaatgta tccccccctc ctgccctggg gacatgggga . gtggcccgtt 120 

tctttcccca ' cctagt-ccca gaaagarggz gcttggtttt ctgttgttgg attttttttt ■ 180 

tCttttttl-.f. httrtgcacc aaagtggcaa ccaggrcagt gttgggggat caagctggcc 240 

tcggggtggg gggcccccac ctgcctctcc ctggttccca cagtgttagc gtccctgaaa - 300 

agacaacatt ccccctaaag caataagggg tgacgggccg gggggagtgt ttgctgctgc 360 

cggcccccag ctcccccccc ctgccaggtg tgggggagac tcctgttgtg actgaatgta ,420 

acccccccac ccctgccgca gccaatgcag gggaaggggg acactcttcc . tgtctcttct 480 

ccccagctaa agagactttg gacttagggg gcccatgagc ctggagaggc . ct taaccccg 540 

cgaggaagto tagggggagc cctctcccac c;t;ccatcc;cc ttctgagagc ggccaacgtc 600 
tacaagcccc cgagcccccc tgcccaggga ctcagacccc gttgccgtcc ctccccggcc . 660 ' 

ccggtcttcc , tgggccctcg ctgctcctcc qccctttctg gggttggggt gggtgca'ggg 720 

gt^accgtgt tccctgtctg ccctgtaccc acagtctccc cgccccctct ccaccctgtg 780 

tgactcccct ctcttttacc tgctcccgta aatactccrt tctcccaata aaact~ggtg 840 * 

tgtgttcccc aaaaaaaaaa aaaaaadddd daaaaaaaaa aaaaaaaaaa aaaaaaaaaa 900 
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aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 949 

<210> 3 6 ; . . • 

<21l> 3338 " ' ■ . ' , , 

<212> DNA t ' • . ; - ■ 

<213> Koir.o sapiens . ' ' ■ . , 1 ' 

<220> 

<221> SITE . ' " 

<222> (2861) ' 7 

<223> n equals a,-.g. or c ' ; ' 

<220> , . ■■' ' •'. • . • 

<221> .SITE 

<222> (3328.) t •' ' ' . • 

<223> n equals a, t , g . or c , ' , 

<220> • '• ' , . ' ' . -' ':: • '' ■ i 

<221> SITE ' 
<222> (3330) : r 

<223> n equals a,t,g, or c ' 1 



*<400> 36 

Caccgggaca ttctggagtc cgagaagtca a'cggcgcggt tgctgcggcc gccgcgctcc '60 

ccggcccgag gcgatggaga acggagcggt „ gtacagcccc actacggagg aggacccggg 120 

ccccgccaga ggcccccgga gcggcctcgc tgcctacttz ttcatgggcc ggctcccatt 180 

gctccggcgc gtccccaagg gcttgcagc t gt tgctgtcc, ctgccggccn ncatctgtga 240 

agaagttgta tcacaatgta ctttatgtgg aggactttat ttttttgagt t-gtaagctg 300 

cagtgccttt cttctgagtc cccttatact gatcgtgtat tgcactccat tttatgagag 3 60 

agttgatacc acaaaagtaa aatcatcgga tt tttatatt' actttgggaa caggatgtgt 4 20 

gtttttgttg gcatccatca tttttgtttc cacacatgac aggacttcag ctgagattgc 480 

tgcaattgtg tttggattta tagcaagttc tatgrtccta cttgacccta ccactatgct 540 

gcatgaaaaa cgacaggagc cccagctgag aaaacctgaa aataccacta gggctgaagc 600 

cctcactgag ccacttaatg cctaaagact ctggggagca gatgttaccc. aaggtagr.ga 660 , 

ccrctgcattg tggtgcctga gccccggcag aagctcttgt aaaatctgtt aattgcttaa 720 
accacttct^ trggagagca aggggaaggt caagaaggca gtcttatcaa tatcgtgtca ( 780 

gtcaccacaa agtaggccag ataagtcaaa - aaaaatt ttt tttcaaataa taatcgaaac 840 

ttatctcaaa tggagatttc ggtgggagga ggagaaaaca attgttttta aatcacacag -900 

ctcaacggtt gataaatgat tctgtcattc tgttacaggt cattctttta' ctaggcttag 960 

c-tccaaact atgctttata gcrgtataaa catcgtgatt' atactcatct acttagaaat 1020 

tgttttattt ttaaattaat ttgctcagct gtttgtttcg. atgcttagat tatgttctgt' 1080 

taatgggaat tcaacatatt caagaaacca ata~ttaaaa tgttggccta ggtttttttc. 1140 

cccaacacac. actaccaggc tttactgtat ttcactcagc cctaaacgcc anaatatttt 1200 

tggataacgg ttatcaacec tttgagacct tcgtatagcc cataaaatgr. acgggagatg 1260 

ttggtatctt atgr.gtataa aagcaacaat accagcaact tcgtgtctat actgcacctt 1320 

ggttgctgat gtcaagtaaa aaaaagattg ttttgtaaca cataaaaaaa tggaagaaac 1380 

tgataccaca ucLaaggacc aaagacaaga aagacttttt gcccaagaca gtgaaagtaa 1440 

ttacaaaaac aagctttgac cacttaccaa gtatc-gaay agatqagttc ataccatgat 1500 

ttagaaagtg gztcaattcc cctgttggca tatgattatr. T.tcactaaaa ttaatacagc 1560 
^tctgtgggtc ttccttagtg t^ttcttrga agccaa'tctg ttttttttag gacaccagcc t 1620 

.rttggttttt catctgttcg agatgccLcL Lctctgcctc cttatcaigac agaaatggag 1680 
tcatgtgctg ctgcttcatc tagcagaggt tggcctctgg" ccctgacact rtctgtcag- , 1740 

tgtctt-agg tggtcctgaa ; 'tcttgggccc Ctttgattgfc gaacaccgts lagcaggatc 1800 

. ttgagagtcc ttgttct-ac ataggcattg ctctagtttg tctccggcaa aaaaaaaaaa i860 

aaaaaaaagt aaatatccag ggaaccctgc ccagactaac actgttggtg gcacaagaga 1920 

atcaagccat tctcaagaga Caacttcaca accagaattg tccgttggct agcagc-gcc • 1980 

acagacaggc agggcacttg ggatatgacc tttctgtcca ggtgacccac agactagacc 2040 

cttcttatcc tcctcccaga gccttgactc gggactctag tgctaagatg atgagcccgr. 2100 
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gcatcaggtc cttctgcaet ttggtggaag tctcccaggg taggtttcct . at ttgaaaca 2160 ' 
gtggaaccat gtttccagtg ataaagttta" acgacctca: cctttt'-ttt ttttctcatc • ■ 2220 
tgccatttgt gtgtzttaga tgggttttaa ttgcatgaat .g.tggctaatg cggttctcag ' '2280 

aaatcggtca gtatggcccr i acatagcttc. tgctctgtct taccgactca atacctttag 2340 

gatttgtatc agagtttgga tactagtgtt agtggtggtg tcaccactac ttaattggga 2400 

gataatgaaa ccaarcargg atgctgtttt tattgggcat gtcatctaag agaggagaaa 2460 

tagctgggct ttgggtctaa ttatgaataa ggactgattc agaaaacgag tttatggtag 2520 

gtagectaaa gttL'cacatc agaccgtacc attccgattt agacctatct aaaattcaga 2580 

gcatatcatc tgggctacct. cagggtcacc. acccatgtat tgggcttagt caggattgac 2640 

agatacattc tcagctggcc tgtcacataa' aacatactgt cat. -gage t't aagctccgct 2700 

tgttctgagg tttcacctcc 'acgtgtttca ttggtgcaaa agtggatctc ttagttggtc 2760 

acttaattct ttctttttca ga'aagatagt atgttcacty gtatatttgg 'ccactcttag 2820 

aaccttcctt cacattgttt tccatgggac ccatgaatgg nttagect-t cttttctatt 2880 

gtagaaggaa ataaatagga gtaaaaagac cattgtagta aataagttca aggggaactt , 2940 

gggaccagaa accactgtta tgtacaaaaa aatggcaaat tca'ataaact caaatttaaa 3000 

ataattctta aattaacagt tatgataaat tttatatttt atacaaatag attgettaga 3060 

atggttctca agaattataa gagaaatgaa ctcacagtac aaaaatttta taattactat 3120 

acctgtgttt tgtttggggg .ctgggaaatg tatttttaca ttgtagccaa tcattttata 3180 

tttgtcaatt taaatcttat gggtcttttt ttttttatct ctcttgatgt cagattttat 3240 
agtcttttta aataaaccca tttaaf.taaa acgttaaaaa aaaaaaaaaa aaaaaaaaao ' '3300 
gggcggccgc 'tegegaatet agaactangn cccacgcg 



<210> 37 
<211> 1563 
<212> ON A 

<213> Homo sapiens.- 



<400> 37 ' 

eggcacgagg agaaaaggac tcagaccttc cagattggct ccagaaggca gctgetgtca 60 

gtgatttr.ee tcacagacca ttgcagccac cta.ccttgcg ccetggtccc gagacagcat 120 

ggcgtcttcc cccacacgga ectgeaggtg gectcttagg agctttctcg ctgaaggagg 180 

tcatccctct tagtcaca-c cgagcagcac tcacttaggc .ctcggccccg agggtcatgc 240 

gggggcacct ggccgggttc ccggctctgt ctggtctggc atctgtctgc, ctttgggcta 300 

ccttttcggc acagctccct gggcctgtgg cr.gccacaag ctgga.ctcca.gctcccctgg 360 

gctgcagtgc tgctcgcagt ggtccagaaa agaggctggg cacagcggct cctg'gctcag 420 

cagcaccctt ggegcagget ggeccaggag ctccgtgcag ggtgctgcca gttgacccag 480 

ccccggctgc actcaatgtc egagaacegg gctggctggg gggectett: gaeggagett 540 

tactccaagt ettgetgaat ttcctgagga aaagcactga tgttctcatg gacacaaggg * 600 

aggcagaatc tcttgaagta gaataaactg cactcctttc cagtcttcct tcctcagctt 660 ; 

ctcctggacc gccagctcct ccatgcgcct 'cagacactcc caacgccgca ctgcgggggt 720 

tcctcccgcc c.tggcccatc -tcatccccct ■ tggctgttga tacagctccc' ctgtgtgcat 780 

gtcgccctgt ggcagatgee cagggacttc tcagactcca ggatcactcc gagtacactg 840 

accacacagc ctgctgcaca aactgeageg ccagctaagg accaggaatc tgacattgtc . 900 

ggaggtgaag ggatcctttg tgacatagct ttcctccagg aggaccatcc actgggtgtc 960 

gggggagrtt ccgccccaag tagcaggagg gagcegtcaa ggcgcggagt Lcacacccaa 1020 

acactcccag aggaeggcac. tctgcatggg acgccatcca gctcctttga ctgtggaatc 1080 

aagtacatca -tcagccggcc cctggctcct ggctgtgacc tccc'ttcgct tgaactgagc 1140 

cttgtgtgta agggegtate aagttgtatg ggcctcgccg cfcgggtgacc ecaaatgeaa 1200 ■ 

ctccaggaag catgggttac caatgactaa ggttaacgat cagccgggcg cggcgctgac 1260 

gectgtaate ccagcacttt gggaggcega ggcgaatgga tcacgaggec aggagttcaa 1320 

gaccagcctg gccaacacgg tgaaaccccc atctctacta aaaatacaaa aattagcegg 1380 

gcgcggcggc gggcgcctgt agtcccagct actcaagagg ctgaggcagg agaatcgegz '1440 

gaactcggga ggegcagett gcagtgagct gagatctege ccgctgcact ccagcctggg 1S00 

eggcagageg agactccgtc ccaaaaaaaa aaaaaaaaaa aaaaaaaact cgaggggggg 1560 

1563 



<210> 38 
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<211> 1048 
<2i2> DNA 

<213> Homo sapiens 



<400> 38 v •' 

cccgggtcga cccacgcgtr cgacaaaaaa caagg-atgt - gtgtgccttg ggatgctttt 1 60 

ccgggcttac cccatctgtg tcctcataga ttctccgccc" -gccagccat gtctttggtc 120 

cacaggagcc acUcccact ccaactctcc cacaacctcc ccactgttta ctctcttcat 180 

gccccgcgct czzgccccaa atcccstcac ccaattggga aaactggatg ataqacaagt 240 

agctccrcac tcacatgcac ttctctggaa atiaaagttg ctttccagct agtatttcct 300 

cacgtcggaa gcacgtgtaa j aataacacat ggaaaczatt tccacctggc acacaagaaa 360 

aaaagggtct gcctatattg taaccgactt ctacactttg catgttttac Lctgacccac 420 

aatntttttt tttggatagt 5 tgaagcaaaa. aacgtggtta caagatgact aaagtttggc 480 
ctacaatctc ctgtttggac aacaggaaga attgtgcagt gttttgcaac aqgactaatt , 540 

ctagattCLc actgccctta aagataacta gagggaaagg gtcttctctt > tccctttcat ' 600 ' 

ttattaaaga tacttacagg ttcttcagaa gaagtgatgg gtcct-tagg toatgaaata 660 

ggtgttgatg. gtgttatggg tgatgatgta actggtggcg- gctgtataaa gtcttcatcc 720 

ttaaacatca gaactgagag t-gggttact gatttaaaag aaaatgtgta ttaatctaca ■ 760 

gtccactacc ttgactacca ggaacntgtc acaactacct catatcctgg cacaggattc * 840 

aaaaggcaaa atctggaacc taattacgrg V.agccaaata .ttaagcttcc tcttagatca * 900 

aacagtacaa taaatattcc ttattaagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 960 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ,1020. 
aaaaaaaaaa aaaaaaaagg " gcggccgc *' - ' 1048 



<210> 39 

<211:> 1430 

<212> DNA 

<213> Homo sapiens 



<400> 39 

cggcacgagg agcagctgag tccctcccct gtctttcact cttctggcat cggtggtttc - 60 

actccttcga ttgaaccccg cttcctcgac ccccctggga ggccgccttc ttcaggcgcc 120 

ccccctctcc ccacgagctc gctctgacag ctgaggaact ggcaagaccc tgctacccag 180 

agggtgaatg ggtatctttc ccggaataat cc taatf.tr.r. ctaagggtga agtttgcaac 240 

ggcggccgtg attgtaagcg gacacc&gaa aagtaccacc gtaagtcatg agatgtctgg 300 

tctgaattgg aaaccctttg tacatggcgg ccttgcctcz atcgtggctg agtctgggac 360 

. tttccctgtg gaccttacca aaacacgact tcaggt-caa ggccaaagca ttgatgcccg . 420 

tntcaaagag ataaaatara gagggatgtt ccatgcgctg ttccgcatcc gtaaagagga 480 

aggtgtattg gctctctacc caggaattgc tcctgcgtt.g ctaagacaag caccatatgg 540 

caccattaaa attgggattt accaaagctt gaagcgctta' ttcgtagaac gtttagaaga 600 

tgaaactctt ttaattaata tgatctgtgg ggtagtgtca ggagtgatat cccccaccat 660. 

agccaatccc, accgacgttc taaagattcg aacgcaggct caaggaagct tgttccaagg 720 

gagcacgatt ggaagcttta tcgatatata ccaacaagaa ggcaccaggg gtctgtggag 780 

ggtaagtact cttttcctgc tattatccta cactctcagt tcctacaatc tgcagagaat 840 

tcttttr.r.at ataaagacac aaaatcgtga attataatcc aaaaactaag gtoagaaact - 900 

cctcatctcc cttgaaaggc ccaaaactta tcattggcct rtcattcccg cataatgttc 960 

ggggattata taggtgggga aagtcattac attattcgag atggctgttt cgatcatatt 1020 

cacagtgaac gtagttgttc agtgtacttt tttgcaagtt ctgtactaac, acgatgatgt 1080 

a^gtctttgt agtgcttatg ctcaaaagct gttaccggct gtgtgctgtg gcccatgccc 1140 

g-agtcccag cactttggga ggccaacgcg ggtggatcac ttgaggtcag aagtmcaaga 1200 

ccagcctggc caaccLgycg aaaccccatc tcaaccagaa atacaaaaat tagccaggca 1260 

cggtggtgca tggctgtggc cacagctact caggaggctg aggcgggaga attgtttgag 1320 

ccctggaggt ggaggttgca gtgagccaag a-catgccac tgcactccag cctgggcgac 1380 ' 

: ggggcgagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa aaaactcgag ' 1430 



<210> 40 
<211> 2103 



WO 00/06698 



<212> DNA . 

<213> Homo sapiens ' 

<220> 

<221> SITE 

<222> (2101) 

<223> n equals a, t,g, 



PCT/US99/17130 
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<22"0> 

<221> SITE 

<222> (2102) 

<223> n equals a,t. 



<400> 40 
ttcctcgtag 
aaagagtggg 
ctttaagggc 

. cgcaggctgc 
' ccccgctcgt 
ggaagcacct 
gagtgtatta 
acccgcttca 
agtggtgatg 
cattgtggac 
cccctacctc 
acctggagga 
gtaatcctac 

. ctctgcagac 
tttccccaag 
cagagaagag 
tgcgcctgct 
tgctcatcga 
ccttctccaa 
.tggtgtcctc 
ggcacgacac 
cagcactucc 
ttggacgctt 
caggcctcac 
tccgggcctt 
ggaagaccca 
atctgctgcc 
gagagggtgg 
gtttgggacc* 
ctgggtgact 

' ctgttctggg 
ggggcaggga 
ggcccagcag 
aaggggatag 
ttgttttttt 

, nna . \- 



cgagcctagt 

gagcaaaggg 

ggggcgtccg 

gaggaaaggc 

cctccccggg 

gactacgaat 

tatcaacact 

agaagcctgc 

ttgatgctcc 

aagaacaagg 

tactcatgca 

gcagctgtac 

ttcctgctgg 

acagagggtc 

gagagtagcc 

gcggaaggct 

ggtgctgacg- 

tgaggctgcc 

gctgggctcc 

agttgtgggc 

tgccatgacg 

tgtcttctct: 

caactggctg 

cacactctgt 

tgggctggac 

gcaccagtga 

taggcctgga 

gtggcagagg 

aggacctcct 

gggLctagcc 

ccatccccat 

aaggactggg 

agcctaagca 

ggaggaggag 

aaaaaaaaaa 



ggcgggtgtt 

aggacagagc 

gacgactgta 

ccctaggctg 

cccagaggca 

gctatccgtg 

tctgtttgca 

tgagttcacc 

tcactctgct 

ccaacagaga 

tctccttc'ct 

tgctcagcct ( 

rrtgcctttag 

ctgctgggta 

cctggttcca 

tcagcctgJcc 

ggcctgtctg 

atgccccgag 

tttggtgccg 

ttctatagct 

cagataattg 

cgaaccctgg 

ggcaatttct 

ctggtgaaga 

agactgccgc 

cctccagctg 

gggaagccca 

ggagcagagc 

gcttttccat 

cctgatccca 

agccatgttt 

ccagggcagc 

ctgtgctatc 

gcttcagcca 

aaaaaaaaaa 



tgcattgaaa 
cctttaaaac 
tctgagcccc 
ggtctgggtg 
cctcggcttc 
cgagaacagc 
acactctaca 
acagggtgtc 
ggtgctaggt 
gtcactctat 
tggggttctg 
z tgaggaggc 
acatggagct 
tgtggcttcg 
ggtgcttgct 
aacggaacct 
tgctcattgt 
gcatgcaggg 
tcattcaggt 
ctccactctt 
ggaactgtgt 
ggctcactcg 
acattgtgtt 
ccttcactgc 
tgcccg-ctc 
ggggtgggaa 
aggctacttg 
catctgcact 
acttaac tgt 
aatctgttta 
acatgatttg 
ctcgggagat 
ctggaggggc 
ccagcaataa 
aaaaaaaaaa 



cgtgagcgcg 
gaggcgggtg 
agactgccgc 
cttggcggcg 
agtcatgctg 
tattccacga 
tcctctgcca 
ctgggccggg 
atggtgtggg 
gacctttggg 
ctgctcctgg 
agccctgacc 
gctacacaga 
tagggcctgg 
gacagcctcc 
gggctacccc 
ggccatccac 
tacctcctta 
tgtactcatc 
ccggagcctg 
ctgtctcctg 
ctttgacctg 
crtczacaac 
agctgtgcgg 
cggt ttcccc 
ggaaaaaact 
gacctcagga 
at tgcataat 
ggcctcogca 
cacatcaatc 
atgtgcaata 
agactgtctc 
tttggaccac 
agttgatccc 
aaaaaaaaaa 



acccgacctt 

gtgcctgcce 

gagtttctgt 

gcggcttcct 

agcagagtat, 

gaggatccgc . 

catcttcctg 

tctmtgagac 

tgg cat cage 

agtactatct 

ccgctggaag- 

egcaggatet 

caggtcctgg 

gatacctggg 

catccctgca 

ctggccatgc 

atccr.ggagc 

ggecaggtet 

ttttacctaa 

cggcccagat 

gtcctaagct - 

ctgggtgact 

geagectttg 

gcagagctga 

caggcatcta 

ggacactgcc 

cctggaatct 

ctgagecaga 

tggggtaggg 

tgcctcactg 

gggtggggta 

cntt.gcctct 

ctgaaagacc , 

agggtttget 

aaaaaaaaaa 



60 
120 
180- 
240 
- 300 
• 360 
,420 
480 
540 
500 
"660 
720 , 
780 ' 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380. 
1440, 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
,;1980 
2040 . 
,2100 , 
2103 



<210> 41 

<211> 2349 

<212> DNA • 

<213> Homo sapiens 



<400> 41 

teg ace;; a eg cgcccgtcta tttctgtgtc tgtatgtatg ggatgtgtct gattgtgtgt 



60 
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ctgtggtgta tttctgtgtic tgtatgtatg gggcgtgtgt gattgtgtgt: gtgtctggtg ^ 120 

tatctctgta 'tgtacggggt gtgtctgtgt gtgactgtgt gtgtgcctgg tgtatctctg *■ 180 

tgtctgtatg tatggggtgt gtctgtgtgt gattgtgtgt ctgtgtttat gtgtgtgtgt , 240 

ctttgtgtga Lt tttt'gtgt ar.ntggcaaa cccagaacag aacattatca ctcacctcac 300 , 

ctggcaaaac aaaaagctt?: cagagagatg tgtggccgcc atgatgrctc agctgcaggc 360 

attttccaaa gttatgt-tg atgcagctgg cacctctctt ctcggggaca aatgtgatct 420 

ttgatcatcg agttgagaat gagtttactc aataatttta aaatgcctcg gtgtgagagg 4B0 

atgggctgct agtaaacatt tgtggcaaat atgcgttgcc ttttcctccc tcaagtgcat 540 
ggggcaggct gcttctctgc acgtccatct g-gggtcagt ctgccctgcc acctc=cttg ■ '" 600 

ccccagggct tagtgccctg tgtggaccac ggctgcccca gggatgctgg ggaaggattc 660 
agcccaacaa atagattcgt cgacgtcctc tcaacgaatg tgtgctttta aaaatgtcrt . :' . .720 

taattttgca ctccatttca tacaacaggt gcacacttaa aacatttatt tttaaacagc. 780 

tctgtcttet cagcactgca gactatttct cgctcacttg acaggcagac gtzgggacta* 840 

tgttcttttc ctgccaagaa tccatcagcc ccccaatgcc cgcacctcga g tccagactc • 900 

cttagcgttg cgttgggacg taattccccg ttagcatccc cagccccttc tccccctgtg 960 

agccagcctg ggttcttttc ttt'tccttt cttccctgtt ttccttcttt cttttttggg i020 

Cttgtaataa gtaatataca gtcttgtggt taagaaatca aaacaataaa gaagatacgg 1080 

cttgaggatt. cttgctctgc actggtcctt cegccaggac ctggtgcgtc ccccaagagg 1140 

gccttgccat gtgcgagctg agacacatgt tcaca'ctttc ctcctttttt gtacaaaagg ' 1200 

tagcacatgg taaatagcgt ctgcacgctt ctcccctcgc- tctgtcttcg ggactactcc 1260 

acatccatga agaagcagca tccttattct cttgcacagt gggcagtggc tacactgtat 1320 

tcactcgtct ctgctgatgg gcactattgt: agtggtgcaa ttagtaacct agtatacatg 1380 

cccttcagcg tgtgtactgg tgtatggatc tcacattccc agaggctgag tgaaagaata 1440 

aaggcttttt gacagzactc cncaggtact cacacactcg ccctgtgagc cacgtgatca 1500 , 

tgggagcggc tccccgcaga ggcgtccga- gggagaggct gatgtccttg tctgatggtt 1560 

gctccaacct gagaatggtg agctgctctc ccaatgcaga ctttcatcac ttgcatctca 1620 

tgcgtgagat tgaactgctt ctcatttacg r.r.r. tea tent rgtttgccca tattttagct 16BO 

gggctgctaa tcttttaatg ttttctaggc gcttttcata aattagggag tttagctgta 1740 • ' 

gtgatgtgag ttgaaagtat tttttcaaat' 'cgtcattgtc ttgactttcc cttccttttt 1800 

tcttttgcca cttaaaatat gcagttgaat ctcagtattt gggggcttct ggattttgag I860 

tcgtagttga aaaggtcttc ctattctgag, gttatacagg aattcaccca tttttttcct 1920 

aatattttac ttccatttaa actttrgacc cacttgigatt gattttgttg tatggtgtca ' 1980 

agtatgggtc caagtttgct tttctgacct ggggatccta acatcgttca tcaaaatctc 2040 

catgttggcc gggcgtggtg gc-caagcct gtaatcccag cactttggga ggctgaggtc, . 2100 

ggtggatcac aaggtcagga gatcgagacc accctggcta acacggcgaa atcctatctc 2160 

tactaaaaat acaacaaatt agccgggtgt . ggtggcaggc acctgtagtc ccagctactc 2220 

aggaggctga ggcaggagaa tcgcttgaac ccgggaagcc gaggetgcag tgagctgaga 22 80 

ttgcgccact gcactccagc ttgggtgaca gagcaagact ctgtctcoca aaaaaaaaaa , 2340 

aaaaagggc 1 2349 



<210> 42 

<211> 1559 

<212> DNA f 

<213> Homo sapiens 



<400> 42 

attcatgcca aaacataggc tttcagtgcc tattacatat ggctttcagc cctctctact 60 

gagggacgta ggagtttatt tcLgaggtcc 'gagcctcttt ccctttaccc cctrtactct 120 

ctcctaagcc ttctttataa aaactatgca tgtcctattg ttctccttcc -gattccctt , 180 

tcttctatta tccccagtag gagtgacttg taactctcat atgtnagaaa ggc:aggtctc 240 

cCggttgaag aaaagatcca cccaagcaag tcagcagttt: aataaacctt tgagggggat 300 

ctcaaatgtg ggaaggattg ttarataaga caaccaaatg atgacatgag acaacaaatg 360 

ctataggaat catggaggaa taattagcta cttactttcc tggttaggga agagatazta .420 

zcagtcgtag aagtaattac taacttccac aLttuttatc gtggaaatca aaaatatata 480 
tatgaaaata aaatgttata accgactcca 1 gtgtcccata aarragcttc caacaattac • . 540 

caaattgtga ccaatccttr. acacacatgc acaggtgicc ctccagtatc cgtiggggcat 600 

tggttctagg accacttatg gataccaaca tctatggatg ctcaagtccc ugatacaaaa .660 .' 

tggtggacta tatgcataca acccgcgcac gtcccgtact aaccaaarcc tccctagatc 720 
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acttacaaca 
aataaccccg 
atatccca'gc 
gtgtgtgtgt 
tgtaaacart 
cttatcctca 
tacttgaatc 
gttcccttta 
.tcctagaatt. 
cacttcctgt 
tagaccatca 
tgttttttag 
"aaatttcca 
aaaaaaaaaa 



cgtaacacaa 
ccaaaaatgt 
ccccaagtcc 
gtgtgtgtgt 
ctctctttaa 
tatacccggt 
tgattcaatt 
ccacttcctt 
tcctgtagtc 
gagctggcag 
actttagatc 
aattacctct 
aattcagatc 
aaaaaaaaaa 



tetaaatgee 

acttctctca 

ccztcaaccc 

gtgtgtgtgt 

aaaaattatt 

qttcagattt 

catgtactgt" 

attaaattcc 

tgaattatgt 

ttagatctag 

atcaacttgg 

taaaattctg 

agcaaaacac 

aaaaaaaaaa 



acgttaacaa 
gtccagacgc 
cctggcacat 
gtgtgcatac 
atcacagcta 
tctagar.tgg 
aatgtttgat 
Ltgtaatttg 
ggtaatcg-t 
aagcttgatt. 
atcatttgtt 
atttaatttt 
aataaaatct 
aaaaaaaaaa 



ccgtcatacz 
cttttttttt 
aggaggctga 
agacacacat 
aacaaattac 
ctcctaattt 
aacttaagta 
ccgtactaaa 
cacatgttcc 
aaattcagat 
teattttget 
ataaccatgt 
attcagagaa 
aaaaaaaaaa 



gtattaagcg 

tgtgtgtgga - 

ctgcgtgtgt ' 

atctctgaaa 

cagtaattct 

ttttacagat 

ccctttatag 

tggattgtct 

ag eg tec tec 

Cttctcr.ctt 

tttgatatgt 

aaaatgttta 

ggcaaaaaaa 

aaaaaaaaa 



780 
840 
900 
960 
.1020 
1080 
1140 ' 
'1200 
1260 
1320 
1380 
1440 
1500 
1559 



<210> 43 ' 

<211> 1766 

< 2 1 2 > DNA 

<213> Homo sapiens 



<400> 43^ 
eggcacgagg 
ttagaaaaat 1 
, t'gtgtctcct 
ttgttctgtg 
aaggatatgg 
gaagaaaaac 
gaagctgeta 
tcaactattg 
gttcatttct 
.tgtgcattca 
tgtgtaagtc 
tctgttttcc 
gtttggacat 
ggggcccaat 
t=gtgatagt 
tcactgtgtc 
catgattgLdt 
ctgtaaagca 
c-tttcttta 
aatacagatg 
gttgtaaaaa 
gcatattatt 
gagagagagc 
" gcagctcatg 
ataaaaagtt 
gaagagtgtc 
aagcagaaag 
atttagagaa ; 
gacatgactg 
aaaaaaaaaa 



agcactgaag 
tgcaagtagt 
tgtagaatat 
tctgaagggc 
ttggatttca 
ctggctgctg 
agcttcctca 
ttttcagtgg 
gtgettcagg 
tttttcctta 
tttcattctt 
agacccaacc 
ttgtccctgc 
gggaggtgtt 
gagttcttg- 
ttgctcctgc 
aacttcctga 
tgcagaactg 
caaaatggaa 
gctctgccat 
toaaaggaaa 
aagaagacaa 
atatgttggc 
aaaaattctg 
tgatgattac 
aaatagaatt 
aagcatgaga 
gaaactcttg 
tgttcatgga 
actcgagggg 



tattcactac 
ggtatattgc 
tctctgatga 
ataaagcatg 
gcaatgagtg 
caggttggaa 
ggaaaattat 
ttctggctga 
aaactaagac 
aaacagcett 
agagttacat 
ctgactgata 
ccaaatttca 
eggatcatgg 
aagatctggt 
tttcaccatg 
ggcctcccta 
ttagecaact 
agaataaagg 
tagtgagaaa 
ttattaatat 
gcataagttt 
tttgctctgt 
catf.gtttat 
■aaaaaaaagg 
Ltcsatctgt 
aagtctacaa 
attaaatact 
acataaagaa 
gggece 



atgaagtata 
aattggcatg 
tactc'actat 
gaaactacat 
ggctttgeat 
cactggcaac 
ttgtttcaat 
aaattggtte 
eatcactggc 
cctaactttt 
'ttctct-ac- : 
tagtctccac 
tgttgaactc 
gggcagatcc 
cattttaaag 
Lgaagtgccc 
gaagecaage 
atacctctrt 
catttcttta 
attgagaege 
acegtctatt 
aacacaatct 
gaaactcaaa 
tatgtgtcag 
gaagcactaa- 
gtataaggat 
tgacatctat 
ttttatgata 
attcctctga 



ttttgcactg 
cactatatga 
tatcccctct 
ttttcagact 
• aaaatttgga 
aatagatacg 
atttctggca 
aattcttcta 
agtttttgtt 
actcccccag 
■ cagtatatcc 
gtgtttcaga 
taatccccag 
ctcacggctt 
tgtttggcac 
gctccagctt 
agatgecate 
tctttacaaa 
tagcaacgea 
tttctctaga 
gtgatcattt 
ggcaatgaat 
tgaaatggtc 
gatcaacctt 
gtaatatagg u 
acttaagcat 
gtcaatataa 
tgaacacaca 
ccaaagagaa 



tggacacaaa 

gcagagtcaa 

ctgetaaget , 

ccattaccag 

agacgaaaga 

gagtttgcaa 

tatgggatca 

tctgagaatt 

9^?ffgatcct 

cccctatggt 

tagtgegget 

tggtgggata ,f 

t.gctgtaggt 

ggtgctgtct 

ttgtaccatt 

cacttttcac . 

accagggttc 

ttaaaaacct 

agaaeggect 

tggcaaaaaa 

actaagttaa 

aaaattgaag 

acctgttcta 

aaattcagtt 

tacagagagg 

tct^taagga 

caagctggat 

cacataatat 

ctggaaaaaa 



60 
120 
190 
240 , 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
, 900 
960 
1020 
1080 
1140 
120'D''- 
L260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1766 



<210> 44 " - 

<211> 2572 

<212> DNA 

<213> Homo sapiens 
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<220> ' ... ' • ' '" '• 

<221> SITE " * ., ' 

<222> (2527) 

<223> n equals a,r,g, -or c 

<400> 44 ' ' - ' 

aattcggcac- gagtctggac. cttcttagmt tgcttgatat ' caggccgtut ttgtagccat ■ 60 * 

tttgcttcac agtcocctyg aacgccggga gcccccgccc atcccgatcc cctccttgta 120 
catgggcgca cttgcgcgct gcaccaccct gtgcctgggc tactacaaga acattcacga 180 
catcatccct gacagaagtg gcccggaqct ggggggagat gcaacaataa gaaagatgce 240 
gagcttctgg tggcccttgg ctctaactct ggccacacag agaatcactc ggcczattgt 300 
caacctcttt gtttcccggg accttggtgg cagttctgca gccacagagg cagtggcgat 360 
tttgacagcc acataccctg tggccacatg ccatacggct ggttgacgga aatccgtgct 420 
gtgtatcctg ■ -ctttcgacaa gaataacccc agcaacaaac tgg'tgagcac gagcaarraca 480 
gtcacggcag cccacatcaa gaagttcacc ttcgtctgca tggctctgtc actcacgctc 540 

tgtttcgcga tgttr.tggac acccaacgtg tctgagaaaa tcttgataga caeca Ccgga . 600 
gtggactttg cctf.gcaga actccgtgtc -gttcctttgc ggatcttctc cttcttccca " 660 
gttccagtca cagtgagggc gcatctcacc gggtggctga cgacactgaa gaaaaccttc 720 
, gtccttgccc ccagctc-gt get gcg gate atcgtcctca tcgccagcct cgtggtccta ' 780 
ccctacc-gg gggtgcacgg tgcgaccctg ggegtrgggee ccctcctggc ' gggctttgtg 840 ' 

ggagaatcca ccatggtcgc. catcgctgcg tgctatgtct aceggaagea gaaaaagaag 900 
atggagaatg agtcggccoc ggagggggaa gactctgcca , tgacagacat gcctccgaca 960 
gaggaggtga cagacatcgt 1 ggaaatgaga gaggagaatg aataaggcac gggacgecat ' ' ' 1020 * 

gggcactgea gggacagtca gtcaggatga cacttcggca tcatctcttc. cetctcccat 1080 

cgtattttgt tcccttttfc- ttgtzttgtt ttggtaatga 'aagaggcert gatttaaagg 1140 

tttcgtgtca attctctagc a-actgggta tgctcacact gaegggggga cccagtgaat ' 1200 

ggtctttact gttgctatgt oaaaacaaac gaaacddLLg acttcatacc cctgcctcac 1260 

gaaaacccaa aagacacagc tgcctcacgg ctgacgttgt gtcctcctcc cctggacaat 1320 

ctccccctgg aaccaaagga ctgcagctgt gccatcgccc ctcggtcacc ctgcacagca 1380 

ggccacagac tctcctgtcc cccttcatcg ctettaagaa ;■ tcaacaggtt aaaactegge 1440 

ttcctttgat ttgRttccca gtcacatggc cgtacaaaga' gatggagece cggtggcctc 1500 
.ttaaatctcc cttccgccac ggagttcgaa accacctact ccacacatgc aggaggeggg ,1560 

tggcacgctg cagcccggag tccccgttca cactgaggaa eggagacctg tgaccacagc 1620 

aggecgacag anggacagaa tctcccgtag aaaggtttgg tttgaaatge cccgggggca 1680 

gcaaactgac atggttgaat gatagcattc cactetgfigr. '.tctcctagat ctgagcaagc ' 1740 

tgtcagttct cacccccacc gtgtatatac atgagctaac tt-ttttaaat tgccacaaaa ' 1800 

gcgcatctcc agattccaga " ccctgccgc* tgacttttcc cgaaggcttg cttttccctc I860 

gcccttcctg aaggtt_*gcat tagagegagt cacatggagc atcctaactt tgcattttag 1920 

tttttacagt gaac tgaagc v tttaagtctc atccagcatc etaatgecag gttgctgtag • 1980 

ggtaact-tt gaagtagata ■■'tattacctgg ttctgc.tatc ettagecata- actctgcggc 1 2040 

acaggtaatt gagaatgtac tacggtac:t ccctcccaca ccatacgata aagcaagaca 2100 

vttttataacg ataccagagt cactatgtgg tcctccctga aacaaegcat tcgaaatcca 2160' 

tgcagtgcag ■tatatttttc fcaagttttgg aaagcaggtz ..ttttccttta aaaaaattat 2220 

agacaeggtt cactaaattg atttagtcag aattcctaga ctgaaagaac ctaaacaaaa 2280 

aaatatttta aagatataaa tatatgetge atatgttatg taatttattt taggctataa 2340 

tacatttcct attr.tcgcat . tttcaataaa atgcctctoa tacaataegg tgaLtgcttg 2400 • 

tgtgctcaac atacctgeag ' ttgaaacgta htgtatcaat gaacactgta ccttattggc 2450 

agcagtttta Luaagcccgc cactcgcatt cgaatgtaag gctcagtaaa tgacagaact 2 520 

atttttrica- tatgggtaac tgggggaata aatggggtca ctgggagtag gg ' 2572 



<210> 45 
<211> 526 
<212> DNA 



<221> SITE 
<222> (66) 
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<223> n equals a.t.g, or c . 

<220> ' ' 

<221> SITE " 
<222> (106) 

<223> n equals a,t,g, or r. 
<220> 

<221> SITE 

<222> (484) ' - a L , 

<223> n equals a,t,g, or c ■ 



<400> 45 ' , , . • 

ctctgacagc ttcctcettg gccaagccct gcctctgtac agcctcgagt ggacagccag. 60 

aggtcnagac tggagcccag agcccaagat ggagccccag ctgggncctg aggctgccgc 120 

cctccgccct ggctggctgg ccctgctgct gtgggtctca gccctgagct gttctttctc 180 

cctgccagct tcttcccctt -cttctctgg.t gccccaagtc 4 agaaccagct acaattttgg 240 

aoggactttc ctcggccttg ataaatgcaa tgcccgcatc gggacatcta tttgcaagaa * -300 

gctctttaaa gaagaaataa 'gatctgacaa ctggctggct tcccaccttg ggactgcctc - 360 

ccgattccct ttgstttcct atccttgcaa attactccar atgattyc'ca aaa-ctggsg - 420 

sccttgtgga ratcttctaa ctggccagca awtwtcaaac, gaaaccccca aacaggaaac 480 

cttntgcccc ctgcacccac ccccaaagaa cttgcacatt yacgtt 526 



<210> 46 
<211> 1032- 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (974) ;.\ 
<223> n equals a,t,g, or c 



<400> 46 • 

gtaaaattgt caaacattta ttttcaaaaa taaaatgtggVtgggdggtta egggtggagt 60 

tattggg^cc tggctgtcgc cactgagcat tgc'tgaatgt tgtcatgact cgcggacttc ■ 120 

ccagagctgc gagcaLgcgg gtgctccccg cggagatttg ctctgtgcct gtaggaaggt 180 

gggtgtgtgg tgtgcattgc ' agcaacattg gtggaacaga tgtgtgtgcc cccatgccgt 240 

catcagggtt cactgcacac gagcctcata cacactacag, aaaatatgct cttgcaaccc. 300 

caagttcatg^ cggaggcatc cacacagatg gcagcagatc aggaaatgct cccagccagt 360 

cctcaggggc agtagggcag ccttcatctg ggtgcgcttg gcagccctga actttacctc 420 ' 

ctcttttcgt tgtatttcct: taacacctta ctcagcattt tagtttcttt agttggatga 480 

gttgtceagt gtctttgtac tgtacagttc tcctgtgctg" ttggaatgga cgtcatgggc 540 

tatgttactc ttttgaaata caaatttctc tttagaagt-.t fctggaaaatc tgttagagga . S0O 

ggagttctga gaatatgaga agacttcttc ccacatagtt gtgcczgccc: gatgtatttt 660 

cccaaaaaaa ttgaaggcga acccgttgga agctcagtcc LgagtCtaac cgactactct 720 

gcagagacat; eiacactctct c tstagcatg , tcccttggaa gaaaataggc tggtgcttgg . 780 

gtggtgatca gaactggcag ttattctagg cctaaaacag tttggcaata aaataagaaa 840 

aaggggccac atgtggtggc tcatgctr.ta atccccacac tttgagaggc ^gaggcaggg 900 

g^atCgcgtg aagccagcct gggcaacata aaatttaaaa a'ttagccaga catggtggca 960 

cgtgcctgtg ggtncagcaa ccLgygtggc taagaccgaa ggatcacttg agcccaggaai 1020 

gtcaaggctg ca . . 1032 



<210> 47 
<211> 2680 
<212> DNA 

<213> HoniO' sapiens 
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<400> 47 

ggcacgagtc ttacagcaag tacttaacaa acctagattt ccataatcag "atttatctca 60 

aaccgcaddd aaaagccgat cccatcacgt taaactgata ggagatcttt atgagagatg 120 

aaagtaagcg atttcaactt tttaattttt ttaattttcg ctctgtctct cacacttgag 180 

gcctttctga aa'ttcaccaa aagagttttg gnagntgtcg ggaatttacc agaacccccc 240 

attattaaaa caattggrtt ctcatattaa aatcatcaca caggttttgt gtacacagac 300 

atttatgcta agccoctgga tagaacLaaa ctgtgtgtag gaacccgtcz tctctaagaa 360 

aggactccct tgagttctgt aggatatgtg cttaaaggtt ttctgtgtca agctatttta 420 

tgaaataaaa -tttacaaac aacgaagcrt cattrtcaca cgttgtattt gtttaatgca 480 

gacttggcct atacgagttt ttgaaattaa gggttttttc cgtaaaacta gcttaggagt 540 

ttaatgagtc actgtgtctc tcacctetg- aggatgagca cagagatcac atttgagaag 600 

caggtgtggg tgttttctgc tggtcacgtc tcctctgccc cccctctggc tcagcagtta 660 

cttttcctca gtcacggggg ctgcattcag ctgcgatgcc agcattaggg ccaagctctc 720 

ccgcttctgt gacgttttat tacacgtact ctttatcagt agtagctgcc atcccctgag 790 

tgttccttcc accgtgeact: gagaagggct gaggattccc gaacatcaac ctcctctaac 840 

cctctgcatg ggaattagat ttttatcatc ctgacrgttg gggctttcta gagctcaggc 900 

aacttgcccc tgggcgggat ggagcctgcc tggggggagc gtgaacgagg cttctgttgt . 960 

ctgtggcttg gccccctgtg gttggaaact tcagaagcag tagattccag gtgcaacatt 1Q20 

tggtgcccgg acaagttgta aaaccrgacc ttggggaggg cctr.caczgg cagccagtgt 1060 

ccacccgtga gcccggtgag cacagacgct agagccaggg ccggcgggct cagaggttgg 1140 

agccggggac cagtgggctc agaggtcgga gccggggacc agtgggctca gaggtcggag 1200 

ccatgtccct gccgcgaagc tctgagggtt ggtggtgaca tcggggcgat tgtggcatcg 1260 

cgactgtggt gtcctggctg cctttggccg gcgctgccac tgccctggtt tcatgacgta 1320 

ggcaccacat ggtcctgtgt tgtctcactt acggctccac ttctggcttt tttctacttg 1380 

ctcacccttg agataaacct ctatttcact ctcarttcgg tttatatgtg' ggtttncttc 1440 

caggtctgat gttaagccta caatattgca atgtgatgtt ttgaagttaa. ygtgtaatag 1500 

agccagtgaa ccaagggttc dcaccccagt gaaatacaaa tattcagaat tgagccactg 1560 

tgtcgccata ctgattatgt aatgtgtgat taacaagtat aatgtgtcac rttcaacatc 1620 

agtttcatgc caaagttgca ttttattaga ttatttggga gttcactttg ggcccaaagg 1680 

ctcgcgtcta cataataata acttatgatt tttctttttg tctttgttt- attttttgtc 1740 

tcgtgttttt tgctttctag accatgccag agtaatctca gctttcttta gttactggat 1800 

cacacatatc cttcctgaga agagcagtga ccaaaatgga atatctcttt aagaacagct 1860 

cctctttaac aaaaaaacct aaaagacaaa tgtgagatgg gcttagagtt agttctctgg 1920 

gaacttgaaa gacatttatg ccatattatt tattcacgtg cttgttcctg gtgggcaaga 1980 

tgccatctga ggcttcagat gagaaattgg ggtaaaatgg aaatttttca cttatttgca 2040 

attatatata tcttgaatta ctacataaaa cttgattctg tttc-ctact. tattgcaaaa 2100 

attgaaaatg gacattctgt Laagtxaaat gtatagctcg aagccca.at atttttatga 2160 

agttttgaat caccttg-at ctgaaagtct ctgctttaag aatgctttct gggtattaaa 2220 

atgttccagt ttaagtagtt tgaatatagt tgagtttttt ttctcctctc tactttgtga 22B0 

atcatatcag gcacccgttt ttcccgtttc gactttcttt tctgtgatag aagcagtcgt 2340 

cagttcttgg tattactaag tgttaaaagc atcagtcagg ccgggtgcgg -ggctcacgc 2 400 

ctgtaatccc ggcactttgg gagaccgagg caggcggatc acaacgtcag gagatcgaga 2460 

gcaccc-ggc caacacggtg aaaccccatc tctattaaaa acacaaaaaa Ctagccgggc 2520 

gtggtggcgg gcgcctgtag tcccagccac tctggaggct gaggcaagag aatggcatga 2 580 

. acccgggagg cagagcttgc agtgagctga gatcgcacca ctgcactgca gcctgggcga 2640 

cagagcgaga ctccagccca aaataaaaaa aaaaaaaaaa 2680 



<210> 48 
<211> 1730 
<212> DNA 

<213> Hotuo sapiens 



<400> 48 

crrcacgcgtc cgggggcctg cggcggagtc caagaggaac coggccgcct tcgggatggt 50- 

gtggagcgct gctcctgccc cctgttgccc cctcggggtc ccggggctgg tccaggtgct 120 

gggggcccaa gccgcgggcc cctggacggc tzcagcgtgt ctgggggcag ctcaggctca - 180 

gccctgcagg ccctgcaagg agagctctcc gagcttatrc tcagcttcag ctcccixaat 240 



1 
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'gactcactga atgagc'tcca gaccactgtg gagggccagg gcgctgatct ggctgacctg 
ggggcaacca aggaccg-at catttctgag attaacaggc cgcagcagga ggccacagag 
range tacag agagtgaaga gcgcttccga ggcctagagg agggacaagc acagcccggc 
cagtgcccca gcttagaggg grgattgggc cgnct^gagg gtgtctgtga acggttggac 
actgtggctg ggggactgca gggcctgcge' gagggcc ttt. ccagacacg t ggctgggctc 
tgggctgggc tccgggaaac caacaccacc ag'ccagatgc aggcagcc=t gctggagaag 
ccggtcgggg gacaggcggg .cctgggcagg cggctgggtg cccttaacag ctccctgcag 
ctcctggagq accgtctgca ccagctcagc ctgaaggacc tcactgggcc tgcaggagag 
gctgggcccc cagggcctcc tgggccgcag ggacccccag gccctgctgg acctccagga 
tcaccaggca*' aggacgggca agagggcccc atcgggccac caggtcctca aggtgaacag 
ggagtggagg gggcaccagc agcccctgcg ccccaagtgg ,cac tttcagc cgctctgagt 
ttgccccggt ctgaaccagg cacggctccc ttcqacagag tcctgctcaa tgatggaggc 
tattatgatc cagagacagg cgtgttcaca gtcgccactg gctggacgct actcgctgag 
cgcggtgctg actgggcacc ggcacgagaa agtggaggcc gtgctgtggc cgctccaacc 
agggcgtggc cggcgtagac tccgggi_ggc tacga'gcctg agggbczgga gaataagccg 
gtggccgaga gccagcccag cccgggcacc ccgggcgrct tcagcctcat cctgccgccg 
caggccgggg acacggtctg cgtcgacctg gtcatggggc agctggcgca ctcggaggag 
ccgctcacca :cttcagcgg ggccctgctc tatggggacc cagagcttga acacgcgtag 
acr.ggggccc cgcccgacgt gtctacgccg gctgaagaga cagcgggggc ggcgggcccc, 
tgggg^ctcg cctgagacgg ggcacctagc cctgggcgag cgccgcaccc gggcccgcag 
uggcaccgcg cccagagcgg cctctcccca cgcccggggc gcgccggctc agggaggctc 
ggggccgccc atgcagactt ttggcctggc gcgatccccc aagaacccct ccagggccgg f 
cctgcggagg agccgatcct cgcaccctcc gctccctcca ctggcccc.cc aggtcgattc 
cctgggctcc aggctccccc gcgcgggcgc cgcccaccgc' catactaaac gatcgaggaa 
taaagacact tggtttttct aaaaaaaaaa aaaaaaaaaa aaggggccgg 

<210i 49 ■ ■' . ■ '.. 

<211> 1275 

<212> DNA . ' ' 

<213> Horao sapiens * 



<400> 49 ' '. ; ' • . * • 

ggcacgagcc agctcctctt ccaagatgtc ctcctcaaac tttgccagtg aaatggagaa 60 

gtactgrttg gggaataata tgctttcaag gttttgctta tttttgataa tgctcctgca 120 

catccttct- trtctggtta ecttcattca- acgtcacact gttgtctctc tctccaagca 180 

ccaccccttt gttccaacta atggatcaaa gagttatagc agcttttaag gcctgattac 240 : 
tacctgagga ggacctttgc ctaggctatt gccgcaaata agaaagatcc tttttaaaca . 3 00 

cactggcgca attctggaag gattacaaca tctatggcag catcaacaac cttggctaag 1 360 

ttgggggtga agtcaccaag gagtgtgtaa argctacctc ggaagagata ttgaagacat 420 

ttgtccatga cttcaaaaca ttgctaagga tgaggatgtt gcaaaaaaaa aatattcaca 480 

aggctttagt tgaaatggca aacaacttta agctggatgt ggaggaggat gatattgagg 540 

aggtcacaga tgtggtcact gaggaattga ctaatgacga gttactggaa ccagaacaga 600 

aatgcatagc aagaggaaag gaaactacag gagaaaaaaa gaaaagcccc caaggatatt 660 

cacagtgacg ggtttaggca gaagcttctg cagaccccaa caaactcctt aaaacgtttg . 720 

aaaatgctga ctccaacatg aaatgatttt catcagtaga gaggaatgt.t catcgtgcat 780 

tatctattta caagcaaatc tatgatgaat aaaagaaaca aaccaaacaa accaccatga 840 

acacatttct gaaaagagtg acatctcaag aagagcccca ggcaggtcct rcaggaggta 900 

tcccagaaga agaaggcatt gttatcacag gacatgacag ctccctgggt gtcattgccc 960 

ctaaoggcct tccagtagga caaggcgtgg agatggaagg tgccgatttc tatgaccctg 1020 
accccacaca ggtctaggct aatgggtaat ggtgcgtttg tgccctagct tttaaagaaa . 1O80 

aatgtaaaaa gtaaaaaata aaaataacat taatataggt aaaagtf.at agaacaagga 1140 

tataaggaga gaaaaaaatc atcttcctct gtatttgtgt ttcaagctgt gttactacaa 1200 
aagaacaaaa aagttaagag, aaatcgtttt taaagtaaaa cagctacaat aatctaaaaa ' 1260 

aaaaaaaaaa aaaaa ' 1275" 



300 
360 
420 
480' 
540 
600 
- 660 
, 720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13.80 
1440 
1500 
1560 
1620* 
1680 
1730 



<210> 50 
<211> 1762 
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' ' _ 32 ' - V " " V 

<212> DNA ; . ' • 

<213> Homo ' sapiens ; ' ■ ■ , 

<220> " ■ ■• ' 

<22l> SITE • , ; ' . . . . ' 

<222> (447) " , , 

<22 3 > n : equals a,c!g, or c 



<400> 50 ' ■;. ' ■ 

. ggtggcccag agacgctgcc gccgcggttg cagttgtcgc gcacgcctct gcccgccagc 60 
ccgctccacc gccgtagcgc ccgagtgtcg gggggcgcac ccgagccggg ccatgaggcc * 120 

gggaaccgcg ctacaggccg tgctgctggc cgtgctgctg gtggggctgc gggccgcgac 180 

gggtcgcctg ctgagtgggc agccagtctg ccggggaggg acacagaggc cctgttataa 240 

agtcatttac ctccacgata cttcccgaag accgaacttt gaggaagcca aagaagcctg ;300 

caggagggat ggaggccagc tagtcagcat cgagtctgaa gatgaacaga aactgataga '360 

aaakttcacc, gaaaacczc: tgccatctga zggtgacttc tggattgggc tcaggaggcg 420 

tgaggagaaa caaagcaata gcacagnctg ccaggacctt tatgcttgga ctgatggcag 480 

catatcacaa cttaggaact ggtatgtgga tgagccgtcc cgcggcagcg" aggtccgcgt •* 540 

ggtca-gtac catcagccat cgycacccgc cggcaccgga ggcccctaca tgttccagtg - 600 

gaatgatgac cggtgcaaca tgaagaacaa tttcatttgc aaatattctg' atgagaaacc < 660 

agcagttcct tctagagaag ctgaaggtga ggaaacagag ctgacaacac ctgtacttcc 720 

agaagaaaca caggaagaag atgccaaaaa aacatttaaa gaaagtagag aagctgcctt 780 

gaatctggcc tacatcctaa tccccagca- tccccttctc ctcctccttg tggtcaccac 840 

■ agttgtatgt tgggtttgga tctgtagaaa aagaaaacgg gagcagccag accctagcac 900 

aaagaagcaa cacaccatct , ggccctcccc tcaccaggga aacagcccgg acctagaggt 960 

ctacaatgtc ataagaaaac aaagcgaagc tgacttagct gagacccggc cagacctgaa 1020 

gaatatttca ttccgagcgt gr.tcgggaga agccactccc gacgacatgc cttgtgacta 1080 

tgacaacatg gctgtgaacc catcagaaag tgggtt-gtg actctggtga gcgtggagag 1140 

tggotttgcg" accaatgaca tttatgagtt ctccccagac caaatgggga ggagtaagga 1200 

gtctggatgg gcggaaaacg aaatatatgg t'tattaggac atataaaaaa ctgaaactga 1260 

caacaatgga aaagaaatga taagcaaaat cctcttattt tctataagga aaatacacag 1320 

aaggtctatg aacaagctta gatcaggtcc tgtggatgag catgtggtcc ccacgacctc 1380 

ctgttggacc cccacgtttt ggctgtatcc tttatcccag ccagtcatcc agctcgacct * 1440 
tatgagaagg taccttgccc agg'tctggca catagtagag tctcaataiaa tgtcacttgg , 1500 

.ttggtcgtat: ctaactctta agggacagag ctttacctgg cagtgataaa gatggqctgt 1560 ' 

ggagcttgga aaaccacctc rgxtcccctt gctctataca gcagcacata ttatcataca 1620 

gacagaaaar ccagaatctt .ttcaaagccc acatatggra gcacaggytg gcctgtgcat. 1680 
cggcaattct catatctgti tttttcaaag aataaaatca aataaagagc aggaaaaaaa " 1740 

aaaadaaaaa aaaaaacLcg ag-.,v 1762 



<210> 51 

<211> 2059 

<212> DNA 

<213> Homo sapiens 



<400> 51 .' 

cggcacgagg tttggrczoa gagtgagcag aactgrgttg ayuLgygagc acacttggtc 60 

gtgttcaaca cagaagcaga gcagtggaag gaggcangca tccccagctg ataccttcgg 120 

ttattgctgt agttttcatc ttacttctcg gtgtctgttt tattgcaagt tgtttggtga ,180 

ctcatcacaa cttttcacgc tgtaagagag gcacaggagt gcacaagtca gagcaccatg » 240 

caaagcr.caa atgcatcaaa gagaaatcag aactgaaaag ttgctgaogg gagcacctqg 300 

aactgttgtc ctattgactg gagaaccctc cagttccaac tgctattctc ctccttactg 360 

acaacaagac gtgggctgag agttgaaagg aactgttcag ggatgggggc ccacctgatg 420 

accatcagca cggaagctga gcagaacctt attactcagc ttctggatag'acggctctcc 480 

tatttccttg gacttagaga tgagaatgcc aaaggtcagt ggcgttgggt gggaccagac '540 

gccatttaac ccacgccaga gcattctggc , ataagaacga acccgacaac tytcagggag 600 

aaaactgtgt tgttctggtc ttataaccaa gataaatggg cccggaatga tgttccttgr 660 

aactLUgaag caagtaggat ctgcaaaata cctggaacaa' cattgaacta gaaactcaga 720 
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aagtggtcct 
tgcgtcactg 
tttttatttt 
gataatgtgg 
agtatttcaa 
cgtgaataat 
ttcaaaagat 
gcactttaaa 
atgzcagtca 
agttctgaat 
tcttactcca 
agtccttcaa 
acattgttag 
ggtatcacag 
aatttctcaa 
aataazcaca 
tatagtttga 
gcatgaaaca 
tcttaaaatt 
aaaagcagta 
tgatcgtgtg 
ttagctgaaa 
aaaaaaacta 



tgtgacggaa 

gaacdcagaa 

gtttgatrca 

cttttgtatg 

ggcgtccact 

gcaatccctt 

gggatattct 

gatttgaaac 

ttacaattac 

tatgctgttc 

cccaacacgt 

tcc-acagca 

:tcatgacag 

gaagctactg 

tcgtttgaag, 

ttttcaaccc 

actagcaagg 

ctaattgcaa 

ttcaatt-Lgc 

C^tattttgt 

gtggtggtgg 

tggtgctaaa 

gaggggggg t 



agagaaaaga 

aacatgctgg 

ttcgagacaa 

gtgtttgacg 

cttcaatcgg 

tgtcattttt 

aactggtagt 

cacattttta 

at tgcatgag 

tacagatgga- 

t- ttgcaaag 

gaggaaataa 

acccaggtgt 

gggattactc 

taacattgta 

atttatacaa 

aagttattgr 

agggcagtca 

caatctctcc 

aaattagcta 

tggtgacgga 

tiaaatttgac 



aaaaccaatt 
ttcatacagc 
catgtgtgta 
"gaaggaataa 
tgtgcactga 
tcccctttct 
ggtgcatcat 
ttgtttgatg 
gaaaatttct 
aaaaagtcca 
caagaagtct 
aatcccccag 
gctccattag 
gaccrcacca 
tticgtgtttg 
attgttaatg 
tctgacaacc 
catccaactt 
tggaaaccta 
ctgatrtttc 
tgcgtgtagc 
tgtgtttgcc 



agaataaggc 

gtttttagtc 

tgtgcgtgtg 

cctttctccg 

atgcatgtat 

cagactctta 

ttctaacc=a 

tttcaLLttc 

ccagaacaac 

aatgccttza 

ttgtaagaca 

aagccaaagg 

agataacata 

cttagctaac 

cattattaat 

tttctttaga 

agaaa ttatg 

taataaaata 

actittcctac^ 

gggtttgraaa 

tgtgttaaaa 

atartcacta 



agaatgtacg 
ataatggtct 
tgtgtgcgta 
ctttcttagt 
ggaagaatag 
gcr.cttaaaa 
aatattgcaa 
, agacttttta 
agtgtggaat 
aaaatttact 
ccctaaacaa 
gctcaccttc :■ 
cattccctct • 
gactggataa 
tcgaatagaa 
gctgtataac 
cttttctggt 

tggtggtctt 

tactactttt 
cactggf.grc ' 
gctgatactt 
aaaaaaaaaa 



780 
840 
900 
96C 
1020 
1080 
1140 
1200 
1260 
.1320 
1380 
1440 
1500 
1560 
1620 
1680 ' 
1740 
1800 
I860 
1920 ■ 
1980 
2040 
2059 ' 



<210> 52 

<211> 3282 

<212> DMA 

<213> Homo sapiens 



<400> 52 
cccacgcgtc 
aaaaaagacc 
ccgggagctg 
gagttgtgcc 
'ttggcaggtt 
gtggtcaa~c 
agggtggttt 
ctcacaaatg, 
tatctccaaa 
gatataaatg 
aatacagctg- 
aacggtatcc 
ggtgatgaag 
cagggagagc 
acctctactg 
gccctgtatg 
tgggcagaag 
tcagaaaata 
gaattgcctg 
ggaggccttt 
cctgaartga 
ctggctgttt 
attcaagaga 
acagaaggcg 
gacttgqgga 
aataacaacg 
agccccgccc 
caggtL-acct 



cgacttaaaa 

cctgggctac 

tggctgtaac 

ccccaagaca 

ctgggtttgg 

tggcaaagga, 

ccgacgataa 

agaaactgga 

ccttaatgga 

atcacgcgcc 

aagggacage 

aaaactacac 

gcatgatata 

ccagcctaac 

tacgcatcgt 

agacccaggc 

atgtagactc 

ttcgaacaac 

attatgagtt 

ctgcaagatg 

tcgtatcacc ' 

ttaagactaa 

atctgccatt 

cgctggacag 

cacccaggct- 

cccccgccr t 

tgcacatcgg 

actcgctgcc 



gagaagcttt 
acggcgtagg 
caactaggaa 
aaggcaagcc 
acgttattcg 
tctgggacta 
caaacaatac 
ccgagagaag 
tgatcccttt 
agtatttcag 
atttagacta 
gatcagcccc 
cccagagcca 
cctcacagcg 
tgtcttggac 
tccagaaaac 
tggagtcaac 
cctccaaatc 
ageaaactct 
-agggtttta 
attttccaac 
tgacagagac 
ccractaaaa 
agagatcaga 
gaaaaccgag 
cacccaaacc 
cagcgccagc 
gccgccccaa 



agctgccaaa 

tgcagggttt 

ataacgtatg 

ctgtttcttt 

gtgactgagg 

gcagaggggg 

ctgctcctgg 

ccgcgrggcc 

cagatttacc 

gacaaagaaa 

gaaagagcac 

aactcttttt 

gtgttggaca 

ctggatggtg 

gtcaat.gara 

agccccattg 

gcgyaciycat. 

aatccttrtf 

tacaaaazaa 

gcggaagtat 

tctg*:rgctg 

cccggagaaa 

ccttctgtgg 

gccgagtaca 

cacaacataa 

tcctacaccc 

gccacagaca 

gacccgcacc 



gattggg/aaa 

cctacLgcng 

cagcagctat 

ttcttttttg 

aaacagagaa 

agctggctgc 

aticacatac 

ctaaagagcc 

gggctgagct 

cagtcttaaa 

aggatccaga 

cccatattaa 

aagcactgga 

ggtccccatc 

atgccccaca 

ggttccttat 

cctattcatt 

ccggggaaac 

atatacaggc 

Cggacaccaa 

agaattctcc 

atggaaagac 

agaatttcta 

acatcactat 

ccgtgctggt 

Cgttcgcccg 

gagactcggg 

tgcccctcgc 



gggaaaggac 

ttcctttatg 

ggctgtcaga' 

gggagtgt cc 

aggatccttt 

aa g99gaac:c 

cgggaatrtg 

ctgtatgctg 

gagagccagg 

aatatcagaa 

tggaggactz 

tatcagcggc ■ 

rcgggaggag 

caggtcrggg 

gct^gcccag 

tgttaaggta 

zcctgatgcc 

ctttctcaga 

aarggacggt 

tgacaatccc 

cgagacgccg 

ggtttgctar 

catcctaatt 

caccgtcact 

ctccgacgtc - 

cgagaacaac 

caccaacgcc 

ctccctggtc 



60 
. 120 
. 180 
240 
300 
360 
.420 . 
'480 
/ 540 . 
600 
660 
720 
780 
840 
900 
4 9S0 
1020 
1080 
1140' 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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-ccatcaacg 
caggcttccg 
gcgctgggtg 
gctgcagaac 
cctggtgacc 
ccagctgctc 
gcgcaccgcc 
caaggacaat 
cggcttctcc, 
cttgctcacc 
* ggtgctcctg 
ctgctcggtg 
cctgtcccag 
caagttcttg 
agaaaattcc 
ttcatatact 
atttaagtta 
aatgtttcat 
ctaaatgata 
tatctcatta 
atcacttctg 
cattctaaca 
tttaaaaaga 
gagcaatact 
atgctacttt 
ctatgcctta 
tttgaaactt 



cggacaacgg 
agtcccgcgt 
cgcgtgctgg 
ggctccgcgc 
aaggtggtgg 
aaggccacgg 
aggctgctga 
ggcgagcctc 
cagccctacc 
gtctacctgg 
ttcgtggcgg 
cccgagggtc 
agctaccagt 
aaaccagtta 
accttccgaa 
tttggtgtgt 
ttatgcaact 
catttttttg 
gtgttaaggt 
cagaaatccg 
tctatagtgt 
tcctaratat 
aatatctctc 
ga a caeca at 
tacttggeca 
taattgaaat 
gtgaaaaaaa 



ccacctgttc 

gggc'gccaca 

tgctggacgc 

cctgcaccga 

cggtggacgg 

agcccgggct 

gegagegega 

ctcgctcggc 

tgcctctccc 

tggtggcgtt 

tgcggccgtg 

cctttccagg 

atgaggtgtg 

etteggata- 

atagcttegg 

tacatageca 

tcaagcatta 

cartaataac 

tttaautctt 

aggttttgat 

aettgetcta 

t.cgtgtttga 

tactactatg 

aataccccta , 

atattttctt' 

aaaattataa 

aaaaaaaaaa 



gccctcaggt 
•gaccgcggct 
eaacgacaac 
gctggtgccc 
cgactcgggc 
gttcggtgtg 
cgcagccaag 
caccgccacg 
ggaggeggee 
ggcctcggtg 
caggaggagc 
gcatctggtg 
tetgaeggga 
ccaggcacag 
atttaatatt 
tgtttctatt 
ttttcaagta 
aactgggttt 
cccaactgcc 
tcatttcaga 
tttaagaagg ' 
aaaccaigtc 
ctcatgacaa 
gttcacatac 
atgttaactt 
tetgectgaa 
aaaaaaaaaa 



cgctggacta 
cccccgcgct 
tcgcccttcg 
cgggcggccg 
cagaacgcct 
tggqegcaca 
cacaggctgg 
ctgcacgtgc 
ccggcccagg 
tcttcgctct 
agggeggect 
gacgtgaggg 
ggccccggga 
ggccctggga 
cagtaaagtc 
agtttacttc 
gtatacccct 
aatttaatga 
caaggaatta 
gettgeatcz 
catatctaca 
atttacttct 
aatgaaacaa 
ttattatttt 
ttgctgatgt 
aatgaataaa 



cgaggccccg 

gaacagegag 

tgctgtaccc 

ageegggcta 

ggctgtcgta 

atggggaggt 

tggtgcttgt 

tcctggtgga 

cccaggccga 

tcctcctctc 

cggtgggtcg ? ; 

gegctgagae 

ccagcgagtc 

ggaagggtga 

tgtttttagt 

taaatctcaa 

gtggttttac" 

gcatcttttt 

attactatta 

catgattcta 

tctccaaact 

acatcatgta 

agcatattgt 

atctttaagc 

ataaaacaga 

aataaaacat 



1740 
1S00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2320 
28fi0 
2940 
3000 
3060 
-3120 
3180 
3240 , 
3282 



<210> 53 
<211> 1860 
<212>' DNA 

<213> Homo sapiens 



<400> 53 
'.aatteggcac 
cccgctcctg 
gaggagcact 
ggaggccggt 
gatggagccg 
teeggaaagt 
cgccctgccc 
cgggatcccc 
tgcccaggtg 
ggacggcacc 
gtttgtgaac 
tatagecegg 
agatccagtt 
aattcctctt 
tctcccncaa 
ggctttcctg . 
tggcactctg 
ccccctccca 
> cacattggtg 
aacaaaccet 
gacctgctca 
ctcttcgact 
gcgctcacca 
cczctccagc 



gagcagcagg 
gecattgetg 
gtggactcgg 
gtgggcttcg 
ctgaaactgc 
ggcagcacca 
cccttcacct 
tacaaccacc 
ctcactgtca 
accaccggca 
cacctgtccc 
ctgctgtcca 
ccaccaggag 
cttcgtgctg 
cgatgcccgg 
gcggcgaggt 
accccc.agg* 
cccgagtgac 
cccccatctt 
actcaatccg 
tt'gtggtgtt 
gcctgctcac 
ccaggtggtg 
ccacagaaag . 



gagaagagga 
acctgctctt 
cagaggacgt 
cLtactcccc 
tgctgacatg 
acccatgggt 
ccctcctcaa 
tgctcttctc 
ctLttggacca 
ccgccatgga 
gcatccatcg 
acccccrgct 
ctgctagttc 
aagagcagcg 
gecgatcagt 
ggagggctgg 
agggaggect 
cccccccacg 
getgettctg 
cgtgcccatg 
ccacaagatc 
catcgcggtc 
tcagccacag 
aacgcaagcc 



tgatgagcat 
ctgcccggac 
ccaccccctg 
ccagcctaac 
ctcctccgag 
tcagttcttt 
caccgtgtgt 
tgactacegg 
cgacagcgcc 
tgatgecgat 
tgaggaggac 
ccagacctac 
tcttctggaa 
aegtcctaga 
gtaagaccag 
ctatctgccc 
ccaaaatggt- 
ccacctgccg 
ageggggage 
gacatcccag 
atcaccagcg 
aacggtaggg 
gccaacc'tgc 
tgtgtaccct;. 



gccaggcccc 
ctcacggttc 
gacagctgtg 
tacatccacg 
gccatgtacc 
tgttccacgg 
gcctatgacc 
gaacccctgg 
agcagcgcca 
cctccaggcc 
ttccagtt-.ra 
ctgccfcaact 
gctcugcyac 
cacccctgtc 
ggcgggggtg 
tgactcccag 
cactgcctgg 
cagctegggt 
ggaacttegg 
ccttcacagg 
ggcaccagcg 
gcccggogcc 
cacccgcccc 
tcatagtagc 



tggccgagtc 
•agagccaccg 
aatacacccg 
atatgaaccg 
zgcccccagc 
agaacagaca - 
ccg-gggcta 
tggaggaggc 
gccccactgt 
ctgagaacct 
ccctcaaggg 
ccaccaagaa 
cccaacaaga. 
cc'catcctct 
ggatgctggg 
gagctcagcc 
gcccctccuu 
gggcccgatg 
ggcgcggccg 
gacccacgcc 
gtcgcagccc 
cgcactcccc 
aggtttcaag 
cacatiaaaa 



60 

• 120 
180 
.240 
'300 
360 
420 
480 
540 
600 
660- 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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aaaaagggtg acaacaattg aataatacgt tttatctatc atcatctcaa catgtagcca 1500 

■ acatgaaaac tactaatgag atatctaata ttcccttctc acactcagtc ttgtcgccca 1560- 

ggctggagtg cagtggcacc a;ctcagctc actgcaacct ccgccccttg ggtccaagca .1620 

attctcatgc ctcagcctcr tgagtaggtg ggattacagg cgCgtgccac cacacctggc 1680 

taatttttgt atctttagta gagacggggL tttgccatgt tgtccaggct ggtctcgaac 1740. 

tcctgacccc agatgatatg cccacctcag cctcccaaag tgczgggatt acaggcatga 1800 

gccactgcat ctggcctttt taaaaaaaaa . aaaaaaaaaa actcgagggg gggcccggta* 18S0 

<210> 54 . ' • • • ' 

<211> 770 . ; 

<212> DNA , 

<213> Homo iSapiens ' . ■ 

<40Q> 54 ' • ' ! V - ' ' , 

aactcggcac gagccgaggg gcgaatccat acatccgtcg tgtggactcg ccctgacatc 60 

ccctcaacgc ttagactcgg cattcttttc'attgcctgat gccgtttcat gccttctttg : 120 ' 

cgtttcctcg ctctgg'ccct cctgctggcc atccttcccg cacctcccaa cgcccacgcc 180 

gcgccgggga ttggcggcct gatcggcggc* ggctccca'gg cctcagccaa ygaagagccc ' .240 ■ 

cagagcaacg cgcaacccag,' cgccgatgag cgcaaacaac gaccgcccag ccaggccgag'' 300 

gaaacccggc agcgcctgac cgatctcaag gccgaacccg ccggcgcacc caaggagatc' 360 

agcgaggccc- agcgtacgct ccccaaactg gtgagcgagg acaacagcga, tctgcccgag 420 

cgcctctcca agctttcggt gccggtactg gagcaacgcc tggcggcccg cgtggacgag ' >" 480. 

ctcgccctct ggcaacaagc gctcagcgcg gccaacagca tgctcatcag cgcgcagacc 540 

cggcccgagc gcgcc'caggc cgatatcagc aagaaccagr tgcgcatcga cgagatcaac'' - 600 1 

ggcctgctga aaagcggtcg ggagaacaac aagccgctga cggacgaacg tcgcgcgctg 660 

ctcgagagta cttctagagc agccgcgggc ccatcgattt tccacccggg tg-gggtacca' 720 

ggtaagtgta cccaattcgc cctatagtga gtcgtattac aattcactgg t 770 

<210> 55 ■ ' 

<211> 1093 ' 

<212> DNA ■ ' ' ' . ' „ ..' 

<213> Homo sapiens / • ' > '. ' 

<400> 55 " - • 

cagatccggc acgaggtttt ttitaagatt tattttttaa aaattatttt tgtggacttg \ ,60 

ggtatcaatg atggcaccta ccttcgggaa cctgtagctg tgctttgaga attgccatcg 120 

gtcatgtgtt gcaccgtcct ctgtatgttt acgtcctntg gactggc^tc: Lcccaggact 180 

, ctcttctgtt tttgtttttt tgattcgggc tttatttttt tctgtgtact gtactatatt ' , 240 < 

gtaaaaggga ttttagcaga gactttagtc tttggggcaa gaggagaaca ggaatgccgg 300 

gctgtttact ttaggcggag aacccatctt "cagacctttg gactattttc tttcaactgc 360 

agtgtataga aaaaccaaac tacgacctca gagcagagta ttaatgaaaa gcacaaaaaa .420 

aggaactaag tccagcgagg ggtgggggga ggggggagat tLLLcLtttg aaaaataacg 480" 

actcttagga catttgtttt' tcagttcaag tgctcttcag cactgtcttg tctcccaata 540 

taccaaccca ctggcacatt tttctcttgt tttctctctc cgattttgct ctqtctcctc 600 

agttaagtgt ttccttcctt tgtgcccccc gctggcgacc ctctgcttcc ctc-ctcttt 660 

ccctttggrn gctgcaatac acagtgttat tttggggaaa taaatctagc aaogcctcgc 720 , 

cctccatgcc gagcgtcctc ttggctctga gagggaaagg Lccgtccttg ggacgc.ctc ^80 

tggtcttttc. tccccccaag tcttectctt tccca'tcata cccztccctg cccaccttgt" 840 " 

tctctgrtcc ccttttatta ggaattccca agtqaacttt:. attaatgtgg gagtggaaca , .900 

gatgctaaaa gctatccagg ar.htr.gr.ttc tgtttgtttt aaa'ttttgtg gttccttccc ' * 960 

tttcctcccc ctcccatgcg. taagacgttc . tgtgtaacct ccat-aiaatt tggcacaaaa 1020 

ccactcgcca gagctgtggt gtcagaaa<aa taaaatatat tgcttcttam aaaaaaaaaa ' 1080 

aaaaaaaact cga , , . - \ 1093 



<210> 56 
<211> 632 
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<212> DNA ■ . 

<213> Homo sapiens 

<220> • ' t t • 

<221> SITE • 
<222> (29) 

<223> n equals a,t,g, cr c 

<220> ' • . • • / t ' ■ 

<221> SITE ' ' 

<222> (46) 

<223> n equals a,t,g, or c 

<220> . ' 

<221> SITE. ' "' • 

<222> {94) , ' ■ •• 

<223> n equals a ( t,g, or c ' t . ' < 

<220> • . 

<22l> SITE ; ■ 

<222> (162) - ' ' 

<223> n equals a, t,g, or c 

<400> 56 ' 

cgcaattaat gtgagtcage tcactcatna'.ggcaccccag gcttcntact ttatgctttc 60 

cggctcgtat gttgtgtgga a ttgtgacgg^ atancaattt cacacaggaa acagctatgc 120 

catgattacg ccaagctcga aattaaccct cactaaaggg ancaaaagcf ggagctccac 1B0 

cgcgg-ggcg gccgctctag aactagtgga tcccccgggc tgcaggaatt cggcacgaga 240 

ctatgtatat atgtttaata tctgtctttt gaaatgcaga aatagtttaa atgtttcttt 300 

gtctattttt cttttttttt aatgctaccc agggaaatat » ttcca tatca ' Cttttaagtg ' 360 

gcctgcctca atgtatattt atrtcttttg aaacaaaaag gttctggaaa ctgttrttct 420 

gtagctttaa atgaataggt ga.gcaaaatc , tatatgggac gtaatttttt. cg-tcagtct 480.*- 
cttaaaaaat actttgtttt gq*tacatctg gttgtgcttg tggggaaaat aaaaacgcag , 540 

agatccttat atatttacgt caaagcaata ttrtattatc: tacataaaac agaaatgcac 600 

aataaaaaaa aaaaaaaaaa gctcgagggg gg 532 



<210> 57 • 

<211> 2587 ,. 
<212> DilA 

<213> Homo sapiens 

<220> " 
<221> SITE * 
<222> (1614) 

<223> n equals a\t,g t or c 
<400> 57 

gtacaccafcg ggcctccacc tcccccccta ccgtgtgggg ctgctcccgg acggcctcct . 60 
gctcctcttg ctgctgctaa tgctgctcgc ggacccagcg ctcccggccg gacgtcaccc 120 
cccagtggtg ccggtccctg gcgatttggg taaccaactc gaagccaagc tggacaagcc 180 
gacagtggtg cactacctct gctccaagaa gaccgaaagc tacttcacaa trtggctgaa 240 
cctggaactg ctgctgccrg tgcatcattg actgctggat cgacaatatc aggctggttt : ' 300 
acaacaaaac atccagggcc acccagtttc czgacggtgt ggatgtacgt guccccggct 360 ' 

ttgggaagac cttctcactg gagttcccgg accccagcaa aagcagcgtg ggttcctatt/ 420 
tccacaccan ggcggagagc cttgtgggct ggggccacac acggggtgag: gatgtccgag 480 
gggctcccta tgactggcgc cgagccccaa atgaaaacgg gccctacttc ctggccctcc 540 
gcgagatgat cgaggagatg taccagctgt atgggggccc cgtggtgctg gttgcccaca . 600 
gtatjggcaa catgtacacg ctctactttc tgcagcggco gccgcaggcc tggaargaca 660 
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agtatatccg ggccctcgtg tcaccgggtg cgccctgggg gggtgtggcc aagaccctgc 720 

gcgtcctggc tr.caggagac aacaaccgga tcccagtcat cgggcccctg aagatccggg 780 

agcagcagcg gccagctgtc tccaccagct ggctgctgcc ctacaactac acatggtcac 840 

- ntgagaaggt gttcgcgcag acacccacaa tcaac -a'cac' actgcgggac . laccgcaagt 900 

tcttccagga caccggctitt .gaagatggct ggcccacgcg gcaggacaca gaagggctgg 960 

tggaagccac ga'tgccacct ggcgtgcagc tgcactgcct ctatggtact ggcgtcccca .. 1020 

caccagactc cttctaciat gagagcttcc ctgaccgtga ccctaaaatc tgctttggtg , 1080 

acggcgatgg r.actgtgaac. ttgaagagtg. ccctgcageg ccaggcctgg cagagccgcc 1140 

aggagcacca agcgttgctg caggagctgc caggcagcga gcacatcgag atgccggcca- 1200 

acgccaccac cccggcctat ctgaaacgtg tgctccttgg gccctgactc ctgtgccaca -1260 

: ggactcctgtggctcggccg cggacctgcc gttggcctct ggggctgtca tggcccacgc . 1320 

gttttgcaaa gtttgtgact caccattcaa ggccccgagt cttggactgt gaagcatctg 1380 

ccatggggaa gtgctgectg ttatcctttc tctgtggcag tgaagaagga egaaatgaga 1440 

gtctagactc aagggacact ggatggcaag aatgctgctg atggtggaac tgctgtracc . <■ 15O0 

ttaggoccgg cu-cacaggg tggaccggct gggccctggt cccagtccct gcctggggcc - 1560 

atgtgtcccc cctattcctg tgggcttttc atacttgcct actgggccct ggcncsgcag 1620 

ccttcctatg agggatgtta ctgggctgtg gtcctgtacc cagaggcccc agggatcggc, 1680 

tcctggcccc tcgggtgacc cttcccacas accagccaca gataggcctg ccactggtca 1740 

tgggtagcca gagctgctgg • cttccctgtg gclLagctgg tggccagcct ga=tggcttc -1800 

, ctgggcgagc ccagtagctc ctgcaggcag gggcagtttg ttgcgttctt cgtggttccc I860 

aggccctggg acatctcacc ccacccctac ctcccttacc accaggagca ttcaagctct 1920 , 

ggattgggca gcagatgtgc .'ccccagtccc gcagctgtgt tccaggggcc ctgatrtcct ■ 1980 

cggatgcgct attggcccca ggactgaagc tgcctccctt caccctggga ctgtggctcc ■■ 2040 

aaggatgaga gcaggggttg gagccatggc cttictgggaa cctatggaga aagggaatcc 2100 

aaggaagcag ccaaggctgc tcgcagcttc cctgagctgc acctcttgct aaccccacca 2160 

ccacactgcc accctgccct agggtctcac tagtaccaag tgggtcagca cagggctgag 2 220 

gatggggctc ctatccaccc Cggccagcac cragctzagr gctgggacta gcccagaaac 2280 

*ttgaatggga ccctgagaga gccaggggtc ccctgaggcc cccctagggg crttctgtct 2340 

gccccagggt gctccatgga tctccctgtg gcagcaggca tggagagtca .gggctgcctt -2400 

catggcagta ggctccaagt gggtgactgg ccacaggccg agaaaagggt acagcctcta . 2460 • 

ggtggggrxc ccaaagacgc cttcasgctg gactgagctg ctctcccaca' gggtttctgt 2520 

■ gcagctggat- tctctctgct gcatacatgc ctggcatctg tctccccttg' ttcctgagtg 2580 

r gccccacatg gggctctgag, caggctgtat ctggattctg gcaataaaag tactcrggat 2640 

gctg Laaaaa aaaaaaaaaa aaaaaaaaaa .aaaaaaaaag ggcggcc 2687 



<210> 5B . .: 1 . 

<211> '619 *' " 

, <212> DNA . 1 ■ 

<213> Homo sapier.s ' ■ , ' ■ ' 

<22o> ; - ' . • ' ' , 

<221> SITE " r . '. ' \ . . 

*<222> (526)'.'' . . 
<223> n equals a,t,g,' or c 

<220> ■ ' ■ ' . '. jt ' 

<221> SITE ' : • • • 

<222> (619) 

<223> n equals a,t,g, or c 
<400> 58 

tcatcagctt tcacctggai catcgtgaca gcctcccact gcttctctac ca^gtggcca 60 
gagctatctt cctaaaacgc attgcaragt tgatcaagtc actctctggc ctaaaacctt 120 
cctcggctcc ccgccgccct cag'gataaag tccggacccc tcagcatggc ttgtgagact 180 
catggtctcc .ttgtccctgc ccacctctct ggtct.racca, cttgccttct tgcattctgg t 240" 
gtcccagcct cctgtatcca gagatgcagt ggctctccat tg'ccactctg attcctcctt 300 
tcttttggcc acagagaaag ggtaccctct ctgtcaaatc ccaactcaga cttcacttcc , 360' 
tccaaggagc tttggctaca ctctctcctc ccgaccccca ccctggcata ctacacagar 420 
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ca'ctctgggc tcacttgcct gcctaatggt catctcceca gtagarfcgta agctccctga 480 

gggcaaggat tgtgrtggaa tttttgfcatt aacagtgcct ggcttngtgc ctggcaccta . . 540 

gaaagcactc aataaatgtt tgtttaatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa. 600 

aaaaaaaaaa aaaaaaaan , ■ fiiq 



<2i0> 59 
<211> 1378 
<212> DMA 

<213> Homo sapiens 



<400> 59 t •'• ' 

tcgacttctg ggatgtaga^ cctcctttat tgaattcttc attttcactc gcagnatgat 60 " 

■ ttattttctt'cacactgtat "ctcacactgs atatgtttct cttataacga gggtcacttt 120 

cctcctgccc aggattggaa tcaattacag ctargcaatg ttttttgttt agcatcttct 180. : 

taattacagg cttggcagaa gagttttgtg aacaactctc catttcactg gctgaagaag 240 

aaatacaact' gtcctctaca gttgaacat- tctgtatgac agcatttcca tggtaatgag 300 

gattaggttc atactttggt ttttccaaac cggggaaaca aacaacagcc aaaaaccagt 360 

gcgcagcLLc attaaggggt acaaaaataa aatccttctc aaaaatatct acgtgccggg 420 

cccatgtttt tactctccca tgccgctttt gctgtattga cagattagtr. gtctcatgat '480 

Ctctcctc-c tctctgatta aggcgtttat agaaaaaaga actgaatata t tgaattcggt 540 

cagcgtcttc cttcttcagt ttttcaagca ccaagtattt caaataaaag cccataataa 600 

catcatttaa aaattcccct tcattuagac agtgcaggtc ctcactggta acagagatgc 660 

ctcccttagc. tggaggtggt ggatatacta . tcaac'ccttc tactgggcca atgaagatgg 720 

tgtggttttc tccagtttct tcttcttcat caaaaaactg aaattcttgt ttgcttctaa ' 780 

gttgtatttt agattzaaat gatacagttt taattttgtt tcccttttgc ccacaacttc 840 
ctttgatgct ctcttcatag gttcttgtac aggeaacaag tctgccatta gcttcttcaa '. 900 

agggaatttt cgcaaaaaan ttggagatgt tattct-tac. accaatttca ttaatgatac 960 
tttcaaatac catacccgcc ggaggatcaa ggccattttg aaaaattaaa attatatatt . 1020 

gttcttctaa atttg-taat ttatttactc ctttacaatc attccaaact ttatcctcct 1080 

tattcatttg cagttggatg ctcagr:ttct gataaactgc tggaattgct tgaagaaaca il40 

ctacaggtaa ttttcggaca ttacaccatt cacatttagt tagatcagag gtatttaata 1200 - 
taatctccac- aggatcatgg tctggttcgc ctagctgtat cttgacaaaa cctaaacaaa . 1260 

aaatcacagg ctctattaac agccggaaga gtgttcctac tcgtcgacgc ggccgcgaat 1320 

tcccgggtcg acgagctcac tagtcggcgg ccgctctaga ggatccctcg aggggccc 1378 



<210> 60 . 

<211> 1126 • ■ 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE ^ 

<222> (21) 

<223> n equals a.t.g, or a 
<220> 

<221> SITE 

<222> (35) 

<223> n eguals a,t,g, or c ' , . 

<220> ' - • ■ 

<221> SITE 

<222> (49) • " 

<223> n egiials a.t.g, or c 

<220> 

<221> SITE 
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<222> (99) - ' • • ' , 

<223> n equals a.t.g, or c 

<220> ' ' 

<221> SITE * 

<222> (1012) . ■ . • ' • 

<223> n equals a, eg, or c 

<400> 60 - 1 , '■ ' 

tccggcttcg taatgtggtg ngaaattgta aeggnataac aatt'tcacne aggaaccagc ... 60 

tatgeccatg attacgecaa gttcgaaatt aaccctcant aaaggaccaa 'aagegaagtc 120 

ccaccgcggt gccggcccgr .tctagaaact agtggatccc ccgggctgca ggaattegge 180 
acgagccggc ccgtaccgcc aggegatege gctga£ggcg gcgctggcag cagcggccaa .. '-240 

gaaggtgtgg agcgcgcggc ggctgctggt gengctgtze acgccgctcg ccctgctgcc » 300 

ggtggtcttc gccctcccgc ccaaggaagg ccgctgcttg rttgtcatcc tgctcatggc 360, 

ggtgtactgg tgeaeggagg ccctgccgct ctcagtgacg gcgctgctgc cca-cgtcct 420* 

gttccccttc atgggcatct tgccctccaa caaggtccgc ccccagtact zee tcgacac 4B0 

caacttcccc ttcctcagtg ggecgatcat ggccagcgcc attgaggagt ggaacctgea - 540 

ccggcgaatc gccctcaaga tectgatget tgttggagtc cagccggcca ggctcatcct V *600 

ggggatgatg gtgaccaccc cgttcttgtc catgtggctq agcaacaccg cctccactgc -660 

catgatgett cccattgcca angccatcct gaaaagtctc tttggccaga aggaggttcg 720 

aaaggacccc agecaggaga gcgaagagaa cacaggaata gaacccaata ctttcctccc ' 780. 

tgaggaaagg ctgaaacttc aagctcccct tgtgaCaaga ettggecaga taactgagtc . 840 

tggtcaatgg aatatgagtg gaaatgatgc gtgeaactte egggtcctgt ccttcctgcc 900 

gggtggaatg tgaatangat ggcacctggg acccaaagac aggagecaca tcttgagaga 960 

tagatggcag atctgcccct gtggctttgg atcatttacc .tcagtgaaca cnacaagcat ■ 1020 
Latccatgaa accattaggtt ttgtgtgcta gttctagttt ttaaaacatg aattaaatta . 1080 

aataegtate tgttaaaact taaaaaaaaa aaaaaaaacta ctcgag, r 1126 

<210> 61 , • h ■ ' - ' ' 

<211> 2078 
<212> DMA 

<213> Homo sapiens 

■ <220> , * • . 

<221> SITE ' * ■ 

<222> (337) * 

<223> n equals a, t,g, or c 

- <220> ' ; ' . ' " " • * 

<221> SITE" ' t ' ' ■ ' 

<222> (492) 

<223> n equals a, t,g, or c * . . 

• <400> 61 _ 

gattattttc aatagattgt caacaaatta tcaaattatg gaaatatcct- t tag at tec t 60 
gtgataatgc tttgtgcatg ttttctatta Ltcagaaaat gaaaacagca gtcatttgac ' 120 

cagccatgtt atgacaccag ataatattct gcaaggatct tccaaacagg gagaaactgc 180 
aatgggaaca atctttggat atctacattg tgtcaaatgt tatgtgcttt atttcatttt . 240 

catacttata acagctgtat atcacagttt ttattatcca cattatagag gcaaggcact 300 
gatatcagga acttgagtaa ttggcttaca cagenantgt agttgaaata gaagccagga ' 360 
ctgtctgatt cccaagacag tgttcttgta. ctgcaggtaa gcctggtgtt gctgggJcttc ' ■ t 420 

cctagcactg agtattgcag aaataeggag atttatttac tcaggtgaat gatgaagcag 480 

aatctttatt tugcttttta aattatkgat gggkeckgag ttaagctagt tcaccaatga 540 

gtctcaaagg gtgaagaact ggcctgtgta gttttaaaac aaagcagcag tttictccat 600 

ttgtagatat tatattcaat ttaccttgea tttccaatag cactaaatat tagaagegta 660 

taagcctttt etttttatge ccaggggata -ggectaaca ctgagaaaca cccgagaaac 720 

accgtcacai: Cccaaggaac aaccaataat aagaaaaatt attrcatata aaaggatgea' 780 
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gcttgcatat 

ggggtgacct 

ggagcaatgt 
' tctttcttta 
atgcacaca- 
cttcaatctt. 
tgcagaactg 
aggaagttag 
ggttgcaacc 
agtgcttaat 
gcacatcaga 
gattttgtgc 
gtcctttgct 
atattccaca 
gtccgagata 
gaagczagca 
ggccaatatc 
tccagcagca 
tggtgaaacu 
tgtagrccca 
cttacagtga 
tctcaaaaaa 



aaaatgctga 

tcatgctctg 

gttccataac 

aaggcgtgcu 

acatatacmt 

tcaggaaagt 

ctgaaattgg 

tccagctgtg 

ttaggtcaca 

ttaaggatta 

taatgagtta 

atagaaagat 

aaaagccatt 

tttaacaggt . 

ttctagttta' 

ttatcctg'at 

gctgatgaac 

tgaggcgggt 

ccatctcxac 

gctactgggg 

gccgagattg 

aaaaaaaaaa 



attcaactga 
gactcttaaa 
tagggttgcc 
tttcaaaaca 
catgcatgka 
ttgtaaaawa 
ctcagatgaa 
ttttgcgagg 
tggcaactaa' 
ttcaatgcag 
actgcatgta 
gtcttagtca 
atataaactg 
gataattatt 
aaccaagtaa 
actagaactg 
atcaatgtga 
ggatcacaag 
taaaaataca 
aggctgaggc 
caccactgca 
aaaaaaaaaa 



agctcattat 

ggatccagtg 

t taagtata-c 

tgatgttatt 

tcatatatgt 

t'agccrraag 

aaggaaagag 

catgaaccag 

attccagcta 

tgaataaatt 

acttgaaaac 

atgtaaaaat 

aagtcactta 

gaaaatatta 

ctaatactgt 

ggaagagaca- 

aaatcttcaa 

gtcaggagat' 

aaaaattagc 

aggagaatgg , 

ctccagcctg 

aactcgta 



gctgccttta 
tcggacacag 
ccttcttccg 
.tcatatacmt 
gcgtgtatgt 
tagaat tatg 
caaaaatcta 
gctgaatcag 
tagaatattg 
tatatcattc 
cttaaatatt 
gccagtgatt 
cctctgtttc 
'gtt^gcctca 
aaaagaagta 
cacacaaaaa 
taaaatactg 
cgagaccatc 
cgggcatggt 
cgtgaacctg 
ggtgacagag 



tgtgagctca 
' ctctgccatc 
ktatctatag 
atatatatat 
atgtatctat 
gggaaacagt 
ggttattcac 
agcaagaaca 
aagacatgta 
ccaaaagtat 
taactgaaag 
tgtgtagcat 
caatgtgata 
ctaatattgt 
ttccttttat 
gaaaacttca 
gcaaaccgaa 
ctggctaaca 
gatgggcacc' 
ggaggcggaa 
caagactctg 



840 
900 
960 
1020 
1030 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
*1920 , 
1980 ■ 
2040 
2078 



<210> 62 

<211> 762 . 
.<212> DNA 

<213> Homo sapiens ; 

<220> ' 

<221> SITE " ' : • ' 

<222> (10) . 

<223> n equals a^t,g, or c 

<220> " " 

<221> SITE ■ 1 " ^ " . 

<222> (12) . " • 

<223> n equals a,c,g, or c ... 
<220> • 1 

<221> SITE ' . . . • 

<222> (42) • ' ■ 

<223> n equals a,t,g, or c ' *. t 

<220> . . ■ ■ . 

<221> SITE : 
<222> (219) 

<223> n equals a.t.g, or c ' ■ , 

<220> . - 

<221> SITE •. , , 

<222> (747) ' • • 

<223> n equals a, t, g, or c 

<400> 62 •- • t . 

cgagtcagtn ancgagaaag cggaagagcg cccaatcgca anccgcctct ccccgcgcgt 
tgcccgattc attaatg'cag ctggcacgao aggtttcccg 'actgaaaagc ggccagtgag 
cgcaacgcaa ttaacgtgag ttagctcact cattaggcac cccaggcttt acactttatg ' 
cttccggctc gtatgttgtg. tggaattgtg agcggatanc aatttcacac agga'aacagc 



60 
120 
1B0 
240 
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tatgaccatg attacgccaa gctcgaaatt aacccrcact aaagggamca aaagctggag * 300 

ctccaccgcg gcggcggccg ctctagaact agtggatccc ccgggctgca ggaattcggc ' 360 

acgagcacaa atgcctatac aaaacttttt tccatgtcct caatactttg tatcagatta - 420 

tttatatttc tcatggttcg gtttcctttt ctcgtcttca tctttctgaa ggtaaaaggg 480 

gattctctcc ccccttttct gatctgtaac ttgrtttgta ,tttggatgat tacaggcgtg , 540 

agccaccgcc tccagccaca aatacttttt agtcgccata agcacaatca agaaatcatt 600 

ntgcaaatgg taagcttttc atgttgtgta ttctttccca tgatcagaga ggtaaaaagt 660 

gscctgggat gcataaagot gtcgtagacm tmtgcaaLct gccacatuat ctctttaaaa 720 

aaaaaaaara aaoactcgag ggggggnacg gaacccaggg eg 752 



<210> 63 
<211> 1094 
<212> DNA 

<213> Homo sapiens 



<400> 63 

teggcacgag gtcaatcaag tgaaatatca tgtaaactgt ccagcagctt tgaaagtaga 60 

gaatgaacaa ggcccctccc ccacccaccc tgtggaaagc ccgtct'ggtt tggtgtcctc * 120 

ctggacagcg tettgeeggt cacctttggc catctcccgg tgcgtggttc agatgtgggt ' 180 

cctgctttcc egccccctcc ctcctctgtg cc tgcctgcc tctg'ctgtgc egggecagtg 240 

ccttggtggc cagtggagtg gacaccagct gcgactgcgt gggaggggct ggcattgccg 300 

ctgccactgc agggcttggg eggctgacat gggacgaggc ttgeacaget gccagctcct 360 

gtctcgctga ctctttttat acagctttgt ctgggccacc gcctccagtg ccacgggccc 420 

cttgccgttc aggctgcccc tcatagatga acaaggccct gccccgcgtc cttacccttt 480 

agagctgttt aaattcaaat gaactgaaac tgaatargaa aaatccaggc cctcagccgc 540 

ccaggccatg f.r.tcaagtgc tccatggcca cacgtggctg gtggacagtg cagctctaga 600 

acattccatc accacagagg gttctgctgg acagtggcct tgggggctgt tttgagggtc 660 

cgcctgtcag tctcc.ggca tcaaagtcat tetgecattg tcaagttaca gttattttcc 720 

ttttacctcc aagecactat gtgcgtatgc tgccatgtgt ctgtatttcc egctaaaett 780 

cctgtcacgg agggaagggt gccacaggcc cagctcergg agggggtttg gatgtctggg 840 

tggggggaag ggtgccacag gcccagctcc eggagggggt ttggatgtct gggtgggggc 900 

cagggcgcca cagttccagc teceggaggg ggtttggatg teegggeggg gctttcttca 960 

tgttccatgt atgataaegg tgactggggg gtttacagag agaggcatac aaatagtgtt 1020 

ggagcgttgc .t^tegctaat ataaatgttt gaatagctaa aaaaaaaaaa aaaaaacr.cg 1080 

agggggggee eggt 1094 



<210> 64 

<211> 1361 

<212> DNA 

<213> Homo sapiens 



<400> 64 ' f . 

'cccgggccga cccacgcgtc cgaatttctt tagtgtattt tatgtacttt tgtattaatt 60 

acattgggta gactgttcca taaatgttga atagatccta tttgttggtt aatgctattg 120 

ttgtgtatcc tgctgatttt .ccgtgtagct ggtctatcag. ttgttgggag aagagtgttg 180 

aagtctacaa ctataattgt gtatttgtcc attactcctt tcagttcttt cagttccatt 240 

tcacatattt cccagctctt gattggtgca cattaatgat tactatgtct ccctggtgaa 300 

cagaccattt tataatttta tgatgtacct ctcttgtcac tttctr.tgct ctgaagtcta 360 
cctgatatta atatagtcac tccagcrttc tttggagtaa tgttagtgtg atatatcttt « 420 

tttegtcett ttacttttaa ccctatctat atcattarat Latcctaagt gagttttttg 480 

tagatagcac ttacttgtgc ccgtcacgtc tttaaatcca ctgtacaagt acaaatctgt 540 

ctttatttrt catttttatc cttattttga gacagagtct cactttgaca ctcagcctag 600 

aatgcagtgg tgtgatctcg gctcactgca acctctgcct ccgagggtca agcaatcctt 660 

tcacctcagc ctcccaagta gctgggacta caggcatgaa ccaccatgcc cggctaatLL 720 

cttttttttt cttttttgta gagatgggga tttaccatgt tgetgaggee ggccctgaac 780 

ccctgggctc aaacgacctg cccagctcag cctcccagag aaatatgatt ttaactagca 840 
catctagatc atttactttc aatggaatta ttgatgeatt acaattcaag tctgtcatct ■ 900 
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catttatcgc cttccatttt tccctccgtt cttcatttct gtgctttttt, rctttgcatg . 960 

taaacatgtt ttaggattct gttttgatgt atctagagta ttcrttagtg' tatcggttta 1020 

tgtagctttt ctagtggatc ctttaagtan tacattatat atacataact aagaagcata 1080 

tcagtgtcat cattttacca gttcaagLaa agtacagaaa tctttgtccc ctttacctgt 1140 

cctgcttata adgtaaacgt cttaaagact tccttgacac acatttagaa ccatatcaca 1200 

gtgttgtttt tgcttcagtc gtgaaacata atctagaaac tcatgagaat aaaaacccat 1260 

tatatttatx tataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa i320 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agggcggccg c ' 1361 



'<210> 65 
<211> 947 
<212> DNA 
<213> Homo sapiens 

<220> , '. 

<221> SITE 

<222> (67) 

<223> n equals a,t,g, or c 



<400> 65 

ctttctatag ttaaagccgg cacgcctgca ggtaccggtc cggaattccc gggtcgaccc 60 

' acgcgtacgg tttctgggtt ctaggccetg ctrccttgcc cctttgctgc agaagggcag 120 

ctgaaggctc accccagaaa ccgggccngg tgggtcttac ccggctcact ccptcccttg 180 

cccttacaca tacaggaaga caagacctga gtggtgctgt ctttgtgtcc gtcgtgtafcg 240 

gctctccctg tct-catctc ttcccactct gtcpctaaac ctccctctct ctcccttccc 3O0 

cctcagtacf. f.artctacag acctatgtgc gtgcycctat ccttctgtcc ttttctctct 360 

tcagctctcc ctgcctctca cacacaatt" 'tacatgcccc gaggagccaa gtztgggaca 420 

tttaccctcc aggcatctgt gtcccctctr. gaagagaaaa , cacacagctt cacacatcca 480 

ggcaxagggg gcaagctctt ggggcatcag gaccctggag caccaggtcc ttcctggaat 540 

attagatcca cctggagcac caggtctctc- taagtctcac ctggggaatt cggtcccacc ' 600 
4 tgggtcacca gttcccacct agagcaetgt gtcetgcect agagcacaaa gacctgctcc ' . 660 

tcccgagact ctctctgact gcagccaggc atagtaccct tgcctgtgtt tgctccctgg 720 

tccacagatt tggtggctgg gcaggtgcct ggacagtgat gaggtctcgc cgccttaact 780 

gtccccccca gtcacttctc ccacaggccc agcaggacgc agtcctgagg atcagggatt 840 
ctacagctgc attaaaatca atcctatcca aaaaaaaaaa aaaaagggcg, gccgctctag < 900 

aggatccaag cctacgtacg ( cgtgcatgcg acatcaagcc ctgaaga ' "■' 947 



<210> 66 

<211> 1376 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (18) 

<223>, n equals a ( t,g, or c 



<400> 66 , ' 

ggcgggcggc gcggaagngg cggckgcgcg gccggggcag ccatgtcgcc attgtctgcg 60 

gcgcgggcgg ccctgcgggt ctacgcggta ggcgccgcgg tgatcctggc gcagctgccg 120 /' 

cggcgctgcc gcgggggctit cctggagcca gtcytccccc cacgacctga ccgtgtcgct 180 

atagtgacgg gagggacaga tggcattggc tattctacag cgaagcacct ggcgagactt 240 
ggcatgcatg ttatcatagc tggaaataat gacagcaaag ccaaacaagt tgtaagcaaa ' 300 

ataaaagaag aaacctrtgaa cgacaaagtg gaatttttat actgtgactt ggcttccatg 360 

acttccatcc ggcagtttgt gcagaagctc aagatgaaga agattcctct ccatgtcctg 420 
atcaacaatg ctggggtgat gatggtccct cagaggaaaa ccagagatgg attcgaagaa , 480 
catttcggcc cgaactacct agggcacttc ctgctgacca accttctctt ggatacgctg • 540 
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aaagagtccg ggccccctgg ccacagtgcg agggcggtca ccgtctcccc tgccacccat 600 

tacgtcgctg agctgaacat ggatgacctt cagagcagtg cctgctactc accccacgca 660 

?cctacgccc agagcaagct ggcccttgtc ctgttcacct accacctcca gcggctgctg 720 

.gcggccgagg gaagccacgt gaccgccaac gtggtggacc ccggggtggt caacacggac ' 780 

scc-acaagc 'acgtgccctg ggccacccgc ctggcgaaga agcttctcgg ctggttgctt 840' 

ttcaagaccc ccgatgaagg agcgtggact tccatctacg cagcagtcac cccagagctg 900 

gaaggagttg gtggccgtta cctatacaac gagaaagaga ccaagtccct ccacgtcacc 960 

tiacaaccaga aactgcagca gcagctgtgg tctaagagtt gtgagatgac tggggtcctt 1020 

gatgtgaccc tgtgatatcc tgtcrcagga tagctgctgc cccaagaaac acattgcacc 1080 

tgccaatagc ctgtgggtcc g-gaagactg' cggtgtttga gtttctcaca" cccacctscc 1140 

cacagggctc tgtcctctag ttttgagaca gctgcctcaa cctctgcaga acttcaagaa 1200 

gccaaataaa cattttggag gataatcacc ccaagcggtc, ttcaaccata aactttgtga 1260 

ttccaaagtg cccagttgtc acaggcgcca caaataatta cattttccdd cataaaaaaa 1320 

aaaaaaaaaa' daaaaaasaa aaaaaaaaaa aaaaaaaaaa aaaaaagggc ggccgc 1376 

"' <210> 67 . • • ■ 

, <211> -2434 '. . 

<212>.. DNA 

<213> Homo sapiens . ,, 

<220> • . 

' <221> SITE ' , ' • ... 

<222> (10 )■ ■ " ■ ; 1 ' '■* 

> <223> n equals a,t,g, or c 

<220> . ' ' '• " 

<221> SITE ' , ' "' ' • . 

<222> (12) ^ ' ' , 

,<223> n equals a, eg, or c -T " ' * , r 

• <220> " ■ , * • * ■ ' ■ - * 

<221> SITE 

<222> (27) n •. , 

<223> n equals a,t,g, or c ,\ 

<220> r • "•' ' .. 

<221> SITE . / - '.. ■ . 4 /' 

<222> (73) • ' - . ' . ' . • ■ i 

<223> n equals a.t.g, or c . * 

<220> ' ' • . 

<221> SITE 

<222> ' (75) v ' ■ 

<223> n equals a,.t,g, or c 

<220> ' " . . ' 

<221> SITE , 
<222> (103) ' - , 

<223> n'eguais a.t.g, or c 

<220> ■; ;-' ■ * ' ' 

<221> SITE 
<222> ;130) 

<223> n equals a,t,g, or c 

<400> 67 : ' ' ' • ' 

ctgggggtan tneaagaace ctctgtngga cttaga'tgtc aagctctttc ctttgggcag ' 60 ■ 

cgtgtttc-tt. ctncncgagc agtgtgctgt gtaaactaaa ttngccggtt cgctttccat 120 
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rttcctgacan ttgagatgga atgccttgac ca::tggtgct "ccgacagaga agtcatggag < 180 

ccattgccac ttcctggttg cccttctgca atgtgaccct gttagtagag gttttctagc 240 

ttc^actaag atacttcttt ccctaaccat catacacttg gcatgtttca ttcccatctc 300 

ccrtcccctc accttaaagg agactacccc tttgccccat attgncaacc taattttctc 360 

tcgtactctc .tctagtgaat gatrgtgctac caagcatatg ccaggctgtg agaggaccat • 420 ■ 

actgagtagt agaaagaagc taatttgaaa taaaaattat ttgtataatt aagaaagcag 480 1 

attagatgca catggtcaac aggaagctga ctgtatgtct- gctagctaga ctcaaaacat 540 

cataaagacg atagcatgtc aatatattag cctagccatt atgttagcct ttgttaggcg 600 

ggcagctttt ctgcttcttc ccttcctctg tggtgacaac ggaggaaaza tccaacagaa 660 

atacgtcraa cagggaaatt gggaucacag tttatatgca tctgacttga aaggagtatt , 720 

gaggaaggtc ttcacacatg arictatcrtt ggatcaaaaa gaacatttat gaaatcaagc . 780 

cttctaacac tagttacaat tgagaagcaa cagtaactcc gtggacagca atcaagctta B40 

aaattgtaaa taaatatggg gataattcag ttgttgcaaa aaaagggcag aactcagtag 900 

aataaagtcc cttcctctta caggtattaa atgaggacag agaacctcag gtgttcctar 960 

' gctagtgcct gccgagtgca Lactaagaaa gcaatcccaa acagacgcat acatctagag ' 1020 : 

agagtggtat cagaga- tea gtgtatgtat ttatctacat ga'gaggaaac tggaatataa 1080 

■ tcccaiaaac tattggaata taatcccata aattatcacc ttttatgact ggaaaatatt 1140 

tgccaatgaa gaaatggtct gtaggtattt gtctcaagac Ltttggccgt ttaataaaaa 1200 

tgtaacttta aeggtttett atagttgect trataaagtg tattgtciaa t aatatttLLg .1260 

tatcatgtgc ctttigaaact tgacagctga tttgggtgtt ggatttctgc ccagccattt 1320 

atcagtatta tcattttat t cagtagctgg caggtgtatt agacaaacga gacttaggta 1380 

aggaatggaa cctttcctgc qgtttgactg cacatcacac cagaagactc cagtatccct 1440 

cattccagaa tgagg'aaaaa gtattctaca aagaacctaa ccacctctgt gaaatctatg 1500 

ggatggaaac agtgtggcct. taggagtcaa atagtctctg catggcgggg aggatcatga . 1560 - 

tggaatatgt gaatttctac c tctagaagt . tgtgaaatag gtcctgcact ttzgcagaat 1620 ' 

gtccctcttt aaacctggc^ tattccacag ctgtagrtga taacatgacc eggggcttag 1680 

ctgctctagc cetgggttbt tcrgagacctc acactgcctg. gcccctggcc atccaccr.aa 1740 

ggactgcctg ctttctggtc acatgtggac cttgatacga ecaageggtt acatatgtgg V 1800 

ttgtgcaaaa gcrtcctgtt taatgeatag tgttaccgot . t taca tcttg gttttcagtg ' 1860 - 

gcactatgtc .taggaggcaa tatcctttta aacagtget- tggctaagac agatacttgt 1920 

gaatcaaaga cagcacagaa aCgaactaag tatatccca- ttggaattat attttgatac 1980 

tatttaaaat ggtttcacct gttaaagggc caacagaact cttggtttta cttttgtaat 2040 

tactgtacag aaaatttcaa gagtgtttga gtgcttgtca tcaggtgttt tccttaataa 2100 

gcagggatat gaccartcac aggaattata catgaaaaaa gnttttgaaa tgtatttttg 2160, 

tgatgtgcta tgttgagggg aaaccaaata trtatgattt taaaacattc gtacgaaaac 2220 

attgtacaac gtaatatget caactttctc aattctttgc taatttttct aagatacatt -2280 

aaaaatgttt tatatttttt tttaagtaaa atggacccag taagaaaatt aaaaatacca 2340 

gaacataaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2400 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa "2434'. 



<2i0> 68 ' 
<211> 1086 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> SITR 
<222> (10) 

<223> ti equals a,t,g, 
<220> 

<221> SITE 
<222> <77) 

<223> n equals a,t,g, 

<220> ' 
<221> SITE 
<222> (1056) 



or c 



or c 
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<223> n equals a.t.g, or c 



<400> 68 . 1 . 

ttga'aacacn ctcttgagca ttaggtccca gciccaccgc ggtggcggcc gctctagaac 60 

tagtggatcc cccgggntgc akkaattcgg cacgagcaac ctgtagtgag ctggctgtca 120 

ctcagcagct ctggaaaatt acctgtctgc actgaatttc ctcatcagta aaatggaaat 180 

gattatagta ccgaccttgt aggattatrc taaaaatcag agaagttcat gcagcttaga, 240 

acagtgccag gcacar.ggna aatgctatgg catacttaag catcttcctc tgtggtgcct 300 

cgtcatcacc atgtgattgt gccttgcttg tccctg^ttc acttttcaga gggagaaaag 360 

tggcccactr caagaatcaa aattctgatg ttacttcggg aaatgcatag agccagagag 420 

acacaacttg acntagtacg atccacatca tcccctcagg ctgaatagtg gtggcatgca 480 

catctatacc aaaatgtttt accttttttg tagaaggaaa atatttgtat cttctattcc 540 

atatcttaga tctttataag agcacttaac tccaacctcc taagaaactg ccaattttgt 600 ' 

tgatcatgat agcctgcaca gafcttecgta ctatctagtg ktg'ggagtgc cctagggacc 660 

atcaacaaca ggsccctcct cttatccacg" agactactga ggccttggag gctatttgtc. 720 

catccatggc gggtgcacrg ccagaggtta tctggttagt agcccaacta agattagaac 780 ' 

ccaggaattc tgatttaatc tcaaatgacc tcttttattt ctgcatccgg gaaggagaag 840 

aggaaagtaa acgggaaatt catgttatct atggaaaagt tatcagcttg atgtrtatta 900 . 

aatgatttgt ttcaggagat tgnttacoac ttttatcttc ctagacaata attLUctgta £60 
agagtaaagc catggtcatt aaggtagtca tattaatgca ctcagtaasc tgtgaagaaa . 1020 

aacaratatc aatgttttcc aacaaaatac agcgancacc rgaaaaaaaa aaaaaaaaaa 1080. 

aaaaaa L086 



<210> 69 , 
<211> 1262 • ■ 

<212> DNA • '• - t 

<213> Homo sapiens . 

<220> 

<221> SITE 
<222> (568) 

<223> n equals a,c,g, or c . r 

<220> ' ' 

<221> SITE 
<222> (639) 

<223> n equals a,c,g, or c 
<400> 69 i. , 

cctyctatag. gkmaagctgg tackcctgca ggtaccggtc cggaattccc gggtcgaccc 
acgcgtccgc ttacaacttc tctyttgttt ctgtgttt.ct tccacaccat tatggcgtat 
ttactgatct acaaccggtc tt'aagctagt tgcctgcrta aaattttctc atgaattctc 
attacatttt caaacaaaat ccaaagttct taccatgacc' ttgtggaLat ctttctgacc 
tcatttacta ctgttacctt ccttattcat tccaitccac ccacagtggc cttcttgcaa 
gtcagagaat gcaacaaata tgctcctgcc ttggggcatt tgcacttctg ttcttttggc 
ctggtcactt cacctccaca ttctccattt t-tatgactt tcttcctata cttggatctc 
tgttcaaatg tcacccctca aaaaggcctt caaaactacc ctatttaaaa cagcacctgc 
cactgccacc tccataccca ttttcatttt ■ tccatagcac ttatccctac tcggcactac 
gttatatatt tgtxtattcg Icttaccgnct 'gtctccccca ctggaccgta agctccatga 
ctgtaagtac atagatgaat tacaaaatga atcaatggng aawtatcctt gmcatatatt 
atttaatgat . ttcgmctcca r.r.r.r.aagtaa aaaaaaaata ccattttttt cactcctcaa 
agtgatatag tt^aatcttc' taaactacat ttttctcatt tcccgattta attaaatcag 
tgatataaaa aaaagagcga tggggatatg , tgaaagaaga ccaaaataga tgccaggaaa 
tactaaaact gctgaagttt agtggtatat tttttcttta* cacacagrat tatttgagtt 
actaattgtg tcactgaact acaqaataag caaaatacta ggtaaacaga atcaergcttg 
ggggctatat txtgtgtaaa aCttgtgtta tgcaaaaata atattaaata tttaattact 
acagttttgt tatctctttc -tattttagg aaatgatttg cagctgagtg aatcaggaag 
tgacagtgat gactgaagaa aLatyLagct ataaataaaa acttauacag cacgtataat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
. 720 
780 
840 
900 
960 
1020 
1080 
1140 
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ttatttcgta tcaacaacaa' aaactcctaa gactgaggga aatacgtcct aacttttgat 1200 * 

gataaaagaa attaaatttg atccagaaat ttcaaaaaaa aaaaaaaaaa aagggoggcc 1260 
gc ' / . '; . 1262 

<210> 70' ' .• 

<211> 1642 

<212> DNA 

<213> Homo 'sapiens 

<400> 70 , . 

ggcgcgctgc ccggarctgc ctgggttgcg ctgccggcca cgtccccgcgr ccgggcctca 60 

ggctccttcc tactgtccga gggccaccag gccgccgggg gcctgctgcg cccggatgcg 120 

tctgtcacta gagtggagag tctaccttcg Lctcacatgt gccacaaagg atggcatggc 180 

ccgggagtgc cccaccacgt ggct-'tcacc ccctgcaaag ccagacttcg 'cccagcgaca 240 

cagtgtcaag cccacagctc tccaaggagg aagatggrcc aggctgggag catcycctta 300 

gcagcagccc ctgatccctt ggccaagcag gagggaacca ttagcagcct gaggagctgg 360 

ctggctggga gcctcgggga ccgcccagcc ttgctcccag ctcacccaca agatgtggac 420 

agctcttgtg ctcatttgga tttcytcctt gtccttatct gaaagccatg cggcatccaa -480 

cgatccacgc aactttgtcc cnaacaaaac gcggaaggga ttagtcaaga ggaatgcatc -540 

tgtggaaaca gttgataata aaacgtctga ggatgtaacc atggcagcag cttctcctgt . 600 

cacattgacc aaagggattc ggcagcccam ctcaactcta tggaagtcrac aacagaggac 660 

acaagcagga cagatgtgag tgaaccagca act:caggag gtgcagctga tggtgtgacc 720. 

tccattgctc ccacggctgt ggcctccagt acgactgcgc cctccattac gactgcggcc 780 . 

tccagtatga ctgtggcctc cagrgctccc acgactgcag cctccagtac aactgtggcc 840 

cccattgctc ccacgaccac agcctceagt atgactgcgg cctccagcac tcccatgaca 900 

cttgcactcc ccgcgcccac qcccactcyc acagggcgga ccccgtccac raergccact 960 

gggcatccat ctctcagcac agccctcgca caagtgccaa agagcagcgc gttgccaaga 1020 
acagcaaccc tggccacatt ggccacacgt gctcagactg cagcgaccac agcaaacaca ^ 1080 

agcagcccca tgagcactcg tccaagtcct tccaagcaca tgcccagtga caccgcggca 1140 

agccctgtac cccctatgck tccccaagca caaggcccca ttagccaggt gtcagtggac L200 
cagcctgtgg ttaacacaac awataiaatcc acamccatgc cctcaaacac aaccmcwgag '• 1260 

cccctcaccc aggccgtggt agacaaaact ctccttctgg tggcgctgtt actcggggtg 1320 

accctt-tca tcacagtctt ggttttgrtt gccctgcagg cctatgagag ctacaagaag 1380 

aaggactaca cccaggtgga ctactcaatc aacgggatgt acgcggacLc agaaatgtga 1 1440 

ggggggcggg ggcctggcgg gaggcctggc cccttcctcg tcctrtcctt ttgcctttga 1500 

gaccaaacca agtgcttcca aattcttttg gtgcaattga ggagatatgc cagatgdtta 1560 

aacacat-ta atf.gctgtca gatraattcc atgatcacta aagagctgct gcttttttca " 1620 

taaaaaaaaa aaaadaaaag gg 1642 



<210> 71 • 

<211> '921 

<212> DNA 

<213> Homo sapiens 

' <220> 
<221> SITE 
<222> (4) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE ', . 

<222> (9) 

<223> n equals a, t,g, or c 

r ■ 

<220> 

<221> SITE 
<222> (11) 
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<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (15) 

<223> n equals a ( t,g ( ' or c 
<220> 

<221> SITE 
<222> (20) 

<223> n equals a,t,g', or c 

<220> . 
<221> SITE 
<222> (901) 

<223> n equals a, eg, or c 



<400> 71, ; 

gcgnggggna r.aggnaagcn ccceactatt gggtccaaaa gctggagctc caccgcggtg 60 
gcggccgctc tagaactagt ggatcccccg ggctgcagga atteggcacg aggtctgago , V .120' 

agataagatt aagggctggg tctgtgctca attaactcct gegggcaegg gggctgggaa 180 

gagcaaagtc ageggtgect acagtcagca ccatgctggg cctgccgtgg aagggaggtc 240 

tgtcctgggc getgetgetg cttctcttag gctcccagat cctgctgatc tatgectgge 300 

atttccacga gcaaagggac tgtgatgaac acaatgtcat ggctcgttac ctccctgcca 360 

cagtggagtt tgctgtccac acattcaacc aacagagcaa ggactactat gcctacagac 420 

tggggcacat cttgaattcc eggaaggage aggtggagtc caagactgta ctctcaatgg 480 

agetactget ggggagaact aggtgtggga aatttgaaga cgacattgac aactgecatt S40 

tccaagaaag cacagagctg aacaatactt tcacccgctt cttcaccatc agcaccaggc 500 

cctggatgac tcagttcagc ctcctgaaca agacctgett ggagggattc cactgagtga 560 

aacccactca caggcttgzc -catgtgctgc tcccacattc cgtggacatc agcactactc 720 

tyctgaggac tcttcagtgg ctgagcagct ttggacttgt ttgttatcct attttgeatg 780 

tgtttgagat ctcagatcag tgttttagaa aatccacaca tettgagect aatcatgtag 840 

tgtagatcat taaacatcag cattttaaga aaaaaaaaaa aaaaaaarct cgaggggggg 900 

nccggtaccc agggeggaag a 921 



,<210> 72 

<211> 906 . " , 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (34) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (833) 
<223> n equals a,t,g, 

<400> 72 

ggaaattctc cctcactaat tggaacaaaa gctngagctc caccgcggtg gcggccgctc 60 

tagaactagt ggatcccccg ggctgcagga atteggcacg agggaagaga gaggggaggg 120 

tgagcagagg acaggceggg agttttccgg gaacggagga agagcagtgg aggztgccag 180 

gatgaggctg ctgtgtggcc tgtggctgtg gctctccttg ctgaaagtcc tgcaggccca 240 

gaccccaacc -cccccgccac tcccgccccc gatgeagage ttccaaggaa accagttcca 300 

9ffSrggaatgg ttcgtcctgg gcctggcggg caacagcttc aggceggage acagggeget 360 

getgaacget ttcaccgcaa cttccgagct aagtgatgat ggccgctttg aygcgtggaa 420 



or c 
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tgcgatgact cgaggccagc actgtgacac atggtcttat gtgctgatac cggcagccca 480 

gcctgggcag ttcactgtgg accacggtgt gggcaegacc tggctgcrgc ctcccgggac 540 

gctggaccag ttcatctgcc tgggcagagc tcarggcctc tcggatgaca acawtgtctt 600 

cccagatgr.g actggargtg ccctggacct carcagcctg ecctgggtgg cagccccagc 660 

ctgaccactc agacagccgc ggcccccaag gcctgactct ' tcttgtggga gggcgaggct 720 

ggtcacccca ggccagcgtc tgttgaagga tgaagcagct cctgtccggc ccagccctgc 780 

ctcacagccg tgcgagctct gccctcctca gctctcaaac ctgaacaaat gcnccaagcc 

cagaaaaaaa aaaaaaaaaa aaaaaaaaaa ctcgaggggg ggcccggcac ccaattcgga 
agatr.g ' 



840 
900 
906 



t <210> 73 

* <211> 580 ' 

<212> DMA 

<213> Homo sapiens 

<220> : 

<221> SITE 

<222* .£7) . 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (9) 

<223> n equals a,t,.g, or c 
<220> 

<221> SITE 

<222> {15) 

<223> n equals a,t,g, or c 

<220> " ♦ 

<221> SITE 

<222> (16) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (22) 

<223> n equals a,t F g, or c 



<40Q> 73 • 

cactcantng aacannagct cnagctccac cgcggtggcg gccgctctag aactagtgga 60 

tcccccgggc tgcaggaatt cggcacgaga tatttcgctg ,gaccc: tagaa aagccaccac 120 

gacctgtggg ccatgatgct accccaacgg ctgctgctgc tgttccttct - cttcttcttt 180 

ccctcccrcc tcaccagggg ctcacGttct ccaacaaaac acaacctttt ggagctcaag 240 

gagtcttgca cccggaacca ggactgcgag actggctgct gccaacgcgc tccagacaat 300 

tgcgagtcgc acr.gcgcgga gaaggggtcc gagggcagtc tgtgtcaaac gcaggtgttc 360 

cttggccagt atagagcgtg tccctgcctg cggaacctga ctLgcatata tccaaagaat, 420 

gogaaatggc ttagcatcgc ctatggccgt tgtcagaaaa ttggaaggca gaagttggct 480 

aagaaaatgt tcttctagtg ctccctcctt cttgctgsct cctcctycty. cacctgctct 540 

cctccctacc cagagc-ctg zgktcar.cc.t gttccccaga gcctccacca tgagtggagg ' 600 

gaagtgggga gtgattgaaa taaagagctt tLtcaatgaa aaaaaaaaaa 1 aaaaaaaaaa 660 
aaactcgagg gggggcccgg ' ; " , ■ 68 0 



<210> 74 
<211>' 1633 
<-212> DMA 
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<213> Homo sapiens / , 



<400> 74 

ggcacgagca ccagtaagta ggagctaggg aagagaaagc acgcaaaagg gcaagaatca. 60 

ggagaactga ^a-agtagct ggacagaact ctagatgtgt gtgtgtgaga gaaagagagg 120 

cagggagaag gagggaggag ttacccccab gatgacctcc aacctccctt tctgcaccct 180 

canccrgggg atagcacagg ctcaggcctg ccctggttgc cctggcgatt ggcctggcct 240 

gggctcaggg gtgggggagg ggctgcacca cattaggacc tgccgtactc caatcccatg 300 

cagtcctcct. gctcctgctg ctgcgtgcct gggctctggt cacgccaggc ttcctzgtgt 360 

cctgcgtctg tggccggttc ctgccaacc. gtctagtcct ttcaggcttg aggccctgca 420 
Ctgctctttc- Lggtcccctc ccctccttcc cgctccccat ctagcctttc ctgggttccg ' 480 

ggatctgctg acagactttc ttcttgccgc ctgcctgctc acatttcaga ' agacccctct 540 
ggaactgccc arggctgtgg tccacccgct ggtiagcaacg ccccgttacc aaatgctaga / GOO 

taatctgcca cccccctctg cagccgccaa ctggtgctga gctgccacct gttgtctgtt 560 

tccacaccct tgtacactgc actctgcctg Lc;ac;tccgga cagctgccca gcagctgcag 720. 
ctgaggccgt cctgccagca cctaccgcag ccccatgggt ggtcctgtcc agttcctggc . 780 

gctcccagcc tgcctcatct gctg'tactta cagagccaca tctggttgaa tacagagctg 840 

gggtagcctg gacgggccca tggcacactc actcccagag aggcgggcat cctctctctg ,900 

' gacrrt-.ctagg gaagggcagg gtctggagcc caataacaaa acacagtaat gataatcata . 960 

gctaatgttt accgagaact taggatgtga taggctcaat gatttaccag aattactcta -1020 
cittgcacaa ccaccccatc aggtaagcac tattaaca'tc ccctttcc'ca catgaagaaa '" ' 1080 

ctaaggctaa ccaatagaaa tgaacctgtcr taatgtccca tgggcataat t-.catggagcc 1140 

aggattcaga accaggcgct ttacttcagg gctgaagctc ataaccacta gaccaaagcg 1200 

cctcctccgt gacctctggg gaaggcccag caacaccctc tattgcgtcc aatgacttcx 1260 

ccctggtctg agatccactg actcacaggg gtagggttaa ggttaaggcc agagtctcag 1320 

c:aagctttg gatactttct tctggattcg gagaaggctg gaataactga atttctgctc ■■ 1380. 

atcttcagga gcggcctacc agagccacac atttcccagc tctccttgtg tgr.ttcccag 1440 ' 

acccttcttc catcgtgtcc tcicccttag gctcctggaa agttttcaga gagaatcacc 1500 

cagtggtaac at tgttaaac " aaaacaggaa aatgggactt gtgtgtatat atgattaaat 1560 

tattaattga tggatctacc ttcttagctc gtgccgaact cgatatcaag cttatcgata 1620 

ccgtcgacct cga \ 1633 



< 2 10> 75 

<211> 1022 

<212> DNA , 

<213> Homo sapiens' 



<400> 75 . • ; • 

ttcccgggtc gacccacgcg tccgcccacg cgtccggctc ggggccagca- ccctgtc'tca , 60 

aagatggcaa aacgaggcta- gttctggatg agccagctgg tgtgggttcc aaccatagga . 120 

acacactgat gctcaaatc- taaggtgcca agctctaggc cctggaggct ggtagaacag 180 

gatctatgcc tggaatcctg gcagggattc ctgtcaagga cttgtgttta agcctgcttc 240 

agggcctcag gctgcttctg ctctgtgtct gcccaggctg gctgagcggg tggatgggcg 300 

gacagaaggg ctcaccaagg attgtggaca tagggtaggc cctggtacca cgggtttcag 360 

gccgccatca cttcccttgt aggaacatag ccagaagcag atgagccagg gtagagggct 420 

ggcccctcct ctcatc.tcc cttcagtctt aaattgtctc cagcgatggg aagaggccag 480 

ggactgtaac ccttgtgr.r.g tgtattctct gagcccctgc tcactctcag ggccaagcag 540 

ctcccaagcc ggggccctct cctggccaaa atctgaggag cagtctaggt tacaggcttt 600 

ttggtaggta ggttctggcc gcctgttaac gcagttaggc cccctgatta ggtacactga 660 

gaaacaagct agaacaaccc tggcccagaa gactgtgcac tccagcaaga- tccagggatg . 720 

atagccttgc agggccaccg ggagtttgtg cccaagr.txc tccct'cttct' ctccccaggg 780 

ggcactggga ctggtccctg ccctcatcct tagcctgggc cttccccaga cgtat"-aaag 840 

agaagzatga ttcctctgtc ttcagczctc ttcaggggca tcctgcccat agcacccagt 900 

tcccaagggg cccccagcca cgcggtgaag cctagcactc atgcagcrct tagggaacca 960 

aaaaccagca ctgaaataaa .gctgaatgac tgaccaaaaa aaaaaaaaaa aaagggcggc - 1020 

c * " 1022 
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<210> 76 
<211> 1184 
<212> DNA 

<213> Homo sapiens 



<400> 76 

agcaaaaggt gaagagagaa aggaagcttt ttctctaaaa atggtgcagc tatcctctga 60 

accaatctcc ttcggtttaa tqtacctgca tctcggggtt ttcttccact taatttatcc 120 

tggagctctt tccataacaa cacttggaaa gcactctcat ccttttttca ctgctgaaca 180 

gaattccact gtgtggatgg aacatactct atttcaccag tcccctgtag ccagtcactt ' 240 

ggtttgtttc caotcttttg ctut utcaga ' gcaataacct tgtacgtcta tcattttgta 300 

tgcatacagg tttatatgta ggaaaaatt™ ^tagagtagg attgccggac caatggataa 360 
aagtacattg tggacagaca atgccaaatt gcctttcaga gaccgtggcc ctgtgcaccc ' [ 420 

catcaggcat gtgrgactac caaagcccct gLcagctgtt t tattttatc tcctttccag 480 

tctcaggctc aatgcagaac tttgaggtaa gcttrtccoa alatgtaggct: cctaaacgcc 540 
acagccagct ctgccacatg aaggagagc- caaacgagac agaaacagcc tccgggcagg ; 600 

atttctatcc tgcacagata tattttccac attctgggaa accgtgaagc ttccagagcc 660 

acaatccccc agaaacacat ccccctgctg gtacagccaa gccccagaac aagctgtgct, 720 

tgcctggcac ctcaaagcca accaccatgg atgccacttg ccatgggtgc ctgcaatctc ' - 780 

aaataatgag aaataagaaa cttcagcttc tcagtccctc tagccaacat ttcaggtgca - 840 

tgacdgt-ctc aggcggcaag cage tac tec geeggacagg gcagaagatg gaacacccca ' 900 

cccctgggga agatcacgag gtcaggagat caagaccatc ctggctaaca cagcgaaacc 960 

ccgtctctac tgaaaataca aaacattagc tgggtgtggt ggcaggcacc tgtagtcccar 1020 ■ ' 

gctactcagg aggctgaggc aggagaatgg cgtgaacctg ggaggtggag gttgcagtga 1060 

gecaagattg cgccactgca, ctccagcctg gcaacagagt aagactccat aCcaaaaaaa 1140 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa - 1184 



<210> 77 
<211> 312 
<212> DNA 

<213> Homo sapiens 



<4C2> 77 *. ' ' . 

aattcccggg tcgacccacg cgtccgtgat gagtggattt gtactcttac ccaggtcctg 60 

ngggccagcc cacccagcac ccccaccccc gatgaegctg tcccracaac tggctgaact 120 

ggtgcatttt gtgtgtgcct tccagagcca gtggactggt gtgtatccaa tgatgccacc 180 

tctgaaacct ■ acagaaccac tacgccttgc atgtgcaccc tgcagggnct gagggecagg - 240 

ctgtctggta gctctgctcc tgggtgacag agcaagactc tgtctcaaaa aaaaaaaaaa 300 

agggeggecg ct 3 12 



<210> 78 
<211> 1370 
<212> DNA 

<213;» Homo sapiens 



<400> 78 

tggctaaaga' gtacgacatt ttagtcaggg ctgagcacag cgcczggccg aggagtgett , 60 

ttatatgece ttggaag^at atggggactc cattcctgnc actgeggega ttccacttat 120 

gcatccacca cgccaaccca aggcagctga tggct^agga aagecagaga ctgagatgtc 180 

aaaatccttg gggctatcta ccaacatgtc tccatcccac ctgttagggt taaaggr.ttt 240 

cctaacctgg gccctgacct tagcacagat ctgcctacat ctttttgaag ttcacccac- \ 300 

tggactatta gccctaaact tcttctgtac tgccactgca gggctgaagg agc^tcgcaL 360 

gcacccacca agtctggctt tcacaccuga atttcacacc tcgccttcac cctzagctat * , 420 

t'ccat'ccttc tgtggaacat cagtgtcact tagcaacagc catacaaccc catcatcctt 480> 

atacctacct .ttcccttcaa agtc-eggat gectgataca ctgcacctgc cagcgcattc 540' 

actcccatta gtacaccccc aagtccttcc agegaaagat gtaacaattg astggecact »f 600 ^ 

gtgecaaagg ugcctgggct ctacccgcca ccagtgatgg ggtccccaca gccaacccag 660 
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tggtgtatca tccatttcca ddaccatc'tc amtgcctgtt ttcttggacc actcctggca 

ccaactgttz caggttagag tgactgaaaa tttgggktat a'agatatcta tcagagatca 780 

atatctatita aaaatgtgaa aggaagcagg at'cktgctga ggaagaagat aacaacaaag 640 

ataccacaat gr.cagcccat caaagccttt ggccaaccca gcasaaaaat ctggagcagg 900 

tgtayctttt tcagagtctc cccagttagg tcaaaatgtg cgcctttaca cccgcacctc . 960 

cctcaaccac gggcttcagg ccgccctggg cacgacctca gatgaagcgg ctcacacagc 1Q20 

cgaggctaac gttgtcggac ctgacagctg aaagccgttt gccgaccaca ctcccacagc' 1080 

cgagcagcat gcccttgctt ggaggaggat ttggatgaca cagctctatg tctgccgtat 1140 

tttgggggtc acatccctca cacccagctc tgcctcaaa: ggcaacatrg acaaaaattc 1200 

attccagttg caaaccaact gtagaacact ataaaaccag agtacaagtc t'aatagcata 1260 

aatctcac.cc xaagtggaaa agaaagggtc atttattcac acatttttgg ggaaaaaaaa , ' 1320 

acaacctt-g ctatgtcttt attacaacac ggacactcac aaaaaatagt 1370 



720 



<210> 79 ■ , : . ", ' . ' . ; 

<211> 368 . . " » 

<212> DNA : * - ' • • 

<213> Homo sapiens > 

<220> . V ' ' t , ; : ' 

<221> SITE * * ' 

<222> (5) , ; 

<223> n equals a,t,g. or c .. 

<220> 

<221> SITE.:' V - 

<222> (13) . 

<223> n equals a.tvg*. or c ( . 
<220> 

<221> site *:'■' . - ■ 

<222> (35)' ! > 

,<223> n equals a,t,g, or c 

<220> , 
<221> SITE 

<222> {351) "< 

<223> n equals a.t.g, or c . ; . 

<220> ; n ;■ 

<221> SITE ; : 

<222> (367) v_ • . ■ 

<223> n equals a.z.g. or c 

* *400> 79 .*'"'*''. ' , 

aatgnaactc tcnctactaa ttggaaccaa agccngagct ccaccgcggt ggcggccgct 60 

ctagaactag tggatccccc gggctgcagg aattcggcac gaggtccagg tgtctttatt 120 

tcagatgttc tgttr.r.r.rcfc ctattttctg ctcacatgaa catacacact tgccaggcac 180 

attctggctt ttcttatttc tccttcegac tttgcctcct tcctgcccct gttttcttcc 240 

cttttctctg gctatagaaa ctgccaggtg gccttgctgg catcatccta ccccttttga ' ■ 300 

accgtgccac ccaggaactt ccatttatta tgcctcgagg gggggcccgg nacccaattc : 360 

ggaagarit , . - . 36 g 

<2L0> 30 '■■ ' ■ . . ' , .. 

<211> 1088 

<212>;-'ONA . * . ^ • 

<213> Homo sapiens 
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<220> 

<22l>' SITE 
<222> £1) 

<22 3> n equals a.fc ,g. or c 
<220> 

<221> SITE 
<222> {A) 

<223> n equals a.z.g, or c ! 
<220> 

<221> SITE ,• .* 
<222> (5) ' 1 

<223> n equals a, t,g, ' or c 

<220> 

<221> SITE 
<222> (?) 

<223> n equals a,t,g ; or c 

«220>- 

<221> SITE 

<222> (11) '•■ 

<223> n equals a, t ( g, or c 



<400> 80 

' nttnnaacng naaatctccc cttactattg ggaacaaaag ctggagctcc accgcggtgg v 60 

cggccgctct agaactagtg gatcccccgg gctgcaggaa ttcggcacga gattccccag ' 120 

aattggttca tcttgggaaa ggaaggccta ggaagctgat gacctacttg ttttgttcat ' 180 

ccatttcnct actgcttctt aaagtgcact caagcggtca ccaggacatc agaaaagcca 240 

. aatccaaggt cccaaggctc ttgatcatcc agtgcccaca acagagggag taagctagac 300 

; ggagaagctg aagtgaagga ggaaagagag aagggggtgc ctgttccacc cccttcctag 360 

cagctgcctc tgtgggctcc ataatcccca tgccczctcc cccaccccac cccccggccg 420 

mc;cacct:ccc ccccacccac cccattcatc agtaggaggg cactgtatca gcacagcctt 480 

tcaaggcctg acagttcctt tctgggtg'cc ccccacatgt aaccacttaa cccccagcac 540 

aaagggccag cctgaatggg cttggagccg gtggctcgtg cgraggggga ggcctcaata 600 

ggatttgggg agctgggtgt ataaaccgct ctgctccagc agccctcaac ggaggagctg 660 

gagttggggc atgtgccgag gggaggagct rrggcagctgc accccgsccc cgaacaatga 720 

tztattcat't ctcctggcac gcacagagaa gacaggctgg ggaaacatgg ccccacaaat* 7 80 

cgccctcagc ctcagctgat tcccacggca atcattgcag tccagagtgc ccccccccac 840 

ttgtaatgtt ttgcctcaag cttgcccttc ctcctcctgg taaggaagta gactacatta 900 

cgggaggcct taagtagcct taaattcagg caagcatcgt tctttgagtt ggttgcaccc 960 

aaactcaatic^ tcactctctc aaagctttct ggccaggccc tcctcacaca cccatagaaa 1020, 

cragatgcaga caaaaaaaaa aaaaaaaaaa actcgagggg gggcccggaa cccaatLegg 1080 

aagatttg l08 8 



<210> 31 

<211> 1B62 ' • ' 

<212> DNA 

' <213> Homo sapiens 



<400> 81 

ggcacgagct ctgcaaagct ccctctgccc cagcacgtcc LgtaLLtttc caact-trgga 60 

cgtttgcaca agctattatc ttttgcaaga atgtcctgcc ccctcctatc catctgcgga ' 120 

agtcaacctt cctttagggc ccagaggaaa acctatcacc ttctacacct ccccagatgc 180 

ccaccatt-g ggtcaaactc cgccccctcc aggtttgcca tggccctttc cctctcttaa 240 

agcactggtc ccagcc-atg ccattgtgtg tcaccctggc tcctgtctcc tactggccgt 300 

agatcttLgg gargggagga accggggaat tttcaccctg gggctauggg acagtgctca 360 
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cccctaagta caaaataagc aagaatcctt gaattgaaaa gctccagtga cttgatagct 420 

tccaacagcc cctaaagatg gggtag-cca ctgctcccta tataggaagg aagaggttca 480 

cagtczcacg gatttagcag ccttcgtagc cccaggaccc t tcccaatigt ecccagttct 540 

cctgctgggc cagctgtggt ctctgagaa- gctcagttcc ctgggctagg ctggctcacg 600 
gggccctccc tcctggacct gggtccatg- gaccaagcgg ccagccagcg agagagccct " 660 

gggtgatgag catccggaag tcagcttgcr tgtctgccct aatcacagcc ctgggctcta 720 

..ccctcaagta ccctactcac atggtccaag agggaaccac tcttctgtca ttccctcccc 780 

ttctgcnccr catcctaccc acccagctgt tctgggcaag atcctgtgggcatgatggaa B40 

tcctcccttt tctctaccat ccancccttt gatcacceaa tcctrcarct gccctccaac 900 

caaggtgttc tttccatacc aaaaaatctg atcaggccar .cactcctc'tg tttaaaacca 960 

ttcaacggct acccactgcc ctcaggataa aaactaaaat tctcaacagg atttacaaat 1020 

tcttcacctg gcccttttga gacaagagga tttataatct tttcaaagcc aacaagcatc 1080 

ctgagacaat cttaatttnt gcagctgtcc aagctattta agattgatga ggtctgcamt 1140 
aagttgttaa atggtgatta atttatctgg ccrgaatggc akgtgaattc caaggggatc : 1200 

atttttctct accLLccccg ccgcacactg ggacacagca aggccaggac acccttycta 1260' 

ggtgactgsg taaccaggtt cactaaggca gtcagcgcag agaagaagca gaaaatgggg 1320 

aggagttgtg taaatgaaat tgccgataac accaaaggaa ccccaagaga ggtteagtct 1380 

cccaacaatt gcagctttgc tgcaacatgg cagggtggga caggggtagg tttggatctg 1440 

ggctgctttg ccmtiggaagr ggggttacag ggcagggaga ygcttaccat cactgagcat -1500 

ctttaattga ct ttgtcatn ttastgagaa cttacaaatt gagctattct tcattgagtt 1560 

a-atctggaa gcaggrtcaa tacacaatac agtcc-ccct ctagaatttc tgctttatgt 1620 

ccacgtgacg cagcttgttc atctccgaat tccttttaat gacaaaaaca r cttggaagga 1680 

-atgtaaagac tcaaaaaggg garggcatgt aaactatagt tggaacaata tgtggccatt . 1740 

taaaagaagt gttgtccagg aggtggaggt tgcagtgagc tgagattgca ccactgcacc 1800 

ctagcctggg ccatagaatg agactccttc tcaaaaaaaa aaaagaaaaa aaaaaactcg I860 

a 9 ,.■ , 1862 



<210> 82 

<211> 1618 

<212> DNA 

<213> Homo sapiens 



<400> «2 

ccacgcgtcc gcaaggagcc agaggccacg cagtggctca gggtccgtga gccgcccggg 60 

gaggccacag gacacagggt caccatgggg acagccgccc tgggtcccgt ctgggcagcg 120 

ctcctgctct Ctctcctgat gtgtgagacc cctatggtgg agetcacctt tgacagagct 180 

gtggccagcg actgccaacg gtgctgtgac tctgaggacc ccctggatcc -gcccatgta 240 

ccctcagcct cttcctccgg ccgcccccac gccctgcctg agatcagacc ccacactaa. ■ 300 

accaccatcc tgaagggtga caaaggggac ccaggcccaa tgggcctgcc agggtacatg 360 

ggcagggagg gtccccaagg ggagcctggc cctcagggca gcaagggtga caagggggag 420 

atgggcagcc ccggcgcccc gtgccagaag cgcttcttcg ccttctcagt gggccgcaag 4B0 

acggccctgc acagcggcga ggacttccag acgctgctct tcgaaagggt ctttgtgaac 540 

'cttgatgggt gctttgacac ggc'gaccggc cagttcgctg ctcccctgcg cggcatctac \ . 600 

ctcttcagcc tcaatg-gca cagctiggaac Cacaaggaga cgtacgtgca cattatgcat 660 

aaccagaaag aggctgzcat cctgtacgcg cagcccagcg agcgcagcat catgcagagn 720 

cagagtctga tgctggacct ggcctacggg gaccgcgrct gggtgcggct cctcaagcgc 780 

cagcgcgaga acgccatcta ca'gcaacgac ttcgacacct acatcacctt cagcggccae 840 

ctcaccaagg ccgaggacga ctgagggcct ccgggccacc ctcccggctg gagagctcag 900 

ctgatacggc atcctgcgag aagacctgcc ctcctcactg ggatcccctt cctgcctcct 960 

cccagggctc tgccagggcc ' ttgctcaqtc ccttccacca aagtcatctg aacttccgtf 1020 
tcccagggcc tccagcr.gcc ctcagacact gatgtctgtc cccaggcgct ctctgcccct ' 1080" 

cacgcccctc tcaccggccc agtgccccga ctccccaggc ttcatcaagg tgctaaggcc 1140 

cgggcgggca gctccccgLc tcagagccct cccccggcct ggtgctgcct ttacaaacac' 1200 

ctgcaggaga agggccacgg aagccccagg ctttagagcc ctcagcaggt ctggggagct ' 1260 

agagcaaagg agggacctca ggcctcccgt tccttcttcc agggtggggc ggcctggtgt 1320 

tcccctagcc ttccaaaccc aggtggccng cccttctccc cagagggagg cggcctccgc 1380 

ccattggtgc tcatgcagac tctggggctg aggtgccccg gggggcga^c tccggcgctc 1440 

acagrcgagg gagccgLygc cccatggcca gangacggaa acagggtctg. accaagtgcc 1500 
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aggaagacct 
cctgccrgtcc 



gtgctataaa ccaccctgcc tgatcccgcc cctgcctgac. cccgccacgc 
agcatgatta aagaatgctg tctcctcttg gaaaaaaaaa aaaaaaaa 



<210> 83 ' 

<211> 2034 ' ■ ■ M ■ . 4 

<212> DNA • 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (14) 

<223> n equals a;t,g, or c 
<22 0> 

<221> SITE 
<222> (382) 

<223> n equals a.t.g, or. c 

<220> , 
<221> SITE 
< <222> •. (1999) 
( <223> n equals a,t,g, or c 

<220> 
<221> SITE 

<222> (2027) * 
<223> n equals a, t.g, or c , 

<40O> 83 . 
actcactaag ggancaaagc tggagctcca ccgcggtggc ggccgctcta gaactagtgg 
atcccccggg ctgcaggaat tcggcacgag atctaccggg agaJcggaytc tgaaagggcc 
ccggccagcg tccctgagac ■ gccaacggca gtcactgccc cccactccag ctcccgggat 
acgtactatc agccccgtgc cctggagaaa catgctgaca gcatcctggc actggcttca 
gtattctggt * ccatctctta ttactcctct cccttcgcct tcttctactt gtacaggaaa 
ggttacttga gtttgtccaa agtggtgccg ttctctcact atgcrgggac attgctgcta 
cttctggcag gtgtggcctg cntccgaggc attcgccgct ggaccaaccc ccagtaccgg 
cagttcatca ccatcttgga agcaacacat cggaaccagt cttcagaaaa caagaggcag 
cttgccaact acaactccga cttccggagc tygccagtcg acttccactg gga'agaaccc 
agcagccgga aggagtctcg agggggccct tcccgccggg gCgtggcccc gcticgccca 
gagcccctgc accgggggacagcagacacc ctcctcaacc gggttaagaa gctgcc-tgt 1 
cagatcacca gctacctggt ggcgcacacc ctagggcgcc ggatgctgta tccaggctct 
gtgtacctgc tgcagaaggc cctcatgcct gtgctgctgc agggccaggc ccgactggcg 
gaagag^gta atgggcgccg ggcaaayctg cLggcccgtg atggcaacga gattgacacc 
atgtttgtgg accggcgggg gacagccgag ccccagggac agaagctggt ■ gatctgctgt 
gaggggaatg ctgggtttta tgaggtgggc tgcgtctcca cgcccctgga agctggatar. 
tcagtcctgg gctggaatca tccaggcttt gctggaagca cgggggtgcc attcccgcag 
aatgaggcta atgccatgga tgtggtggtc cagtttgcca tccaccgcct rggcttccag 
ccccaggaca tcatcatcta cgcctggtcc atcggcggct tcaccgccac gtgggcagcc 
atgtcctacc cagatgttag tgccatgatc ctggatgcct cctt-gacga cctggtgccc 
ttggccttga aggtcatgcc agacagccgg aggggcctgg tgaccaggac cgtgaggcag 
catctcaatc taaacaacgc ggagcagctg tgcagatacc agggtcctgt actgctgatc 
cggagaacca aggatgagat catcaccacc acggttcctg aggacatcat gtccaaccga 
ggcaatgacc tcctgctgaa gctcctgcag catcggtatc cccgggtgat ggcagaggag 
ggtcctcgag tggtgaggca- gtggttggag gcctcctcac agctggagga agcctcaatt 
tatagccgat gggaggtgga agaggactgg cgtcr.ctcrg trcctccgctc ctaccacgca 
gaacacgggc ccgacttccc ctggagcgtg ggggaggaca tgagtgcaga cg^acggcgg 
cagctggctt tgtttczggc tcggoagcat c;gcacaact ttgaggccac tcaccgcacc 
ccactcccag cucagaacty ccagatgccc tggcacctcr. agggaccaac tgggactcat 



1618 



; ■ eo 

120 
180 
240 
, 3O0 ! 
360 
420 
480 
< 540 
600 
660 , 
t 720' 
. 780 
840 
900 
960 
1020 
1080 
114C 
1200 
12S0 
1320 
1380 
1440 
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tatggaagaa' cggggtgaga ggagacatga ggaaagaccc, ccttawttgt gattctctgt 1800 

. gttcacgttg ccgtttacag' tttgrggaaa gtgggggacc acccccct-c tcaccactgt . i860' 

ccctcttgca cgtttcccct cattcatgtg gctgtactda accctctcca acatacatcc 1920 

tgcattacat gaacggatta ttcctaataa ttaataaaaa ggtatttttt ctacaaaaaa ; 1980 

aaaaaaaaaa aaactccana atLacttcct aaaaccggcc cgcgggnccc atcc - 2034 



<210> 84 

,;<211> 2240 

*<212> DNA 

<21Z> Homo- sapiens 



<400> 84 ' , 

zcgaccacgc ? gtccgcttcc tcatgctgcc cgctcgccca ccgtccaggc tgctgagcct 60 

tttccttcgt gga^cuycLc ccacggcagc gcgccatggc ctccgggagc cgctcctgga ... 120 

! gaggagg-gc gctgccgccp cctccttcca gcactcatcg agtctgggac gcgagcttcc 180 

. ttatgacccc gtggacacgg agggctttgg agaaggtggtv gacatgcagg agcgttttct 240 

gttcccggag tacatcctgg atccggagcc gcaacccacc cgcgaaaagc ( agctgcagga 300* 

gctccagcaa cagcaggagg aggaggagcg ' acagaggcag cagcyycggg aggagcggcg . • 360 ~ 

acagcaaaac ctacgggcca ggccccggga. gcacccggtc gtggggcacc cggacccggc ■> 420 

attgccgccc agcggcgtga accgctcggg ctgcggggca gagctgcact gccaggacgc 4 80 

cggagtgccc ggctacctgc cccgagagaa gttcctccgc acggcggagg cagacggcgg 540 

gctggcacgg accgtgtgcc agcgctgctg gccgctgtcg caccaccggc gcgctctacg / '600. 

cctgcaggtg agccgcgagc agtacctgga -gctggtgagc gccgcgttgc ggcggcccgg 660 

cccctccctg gtgctctaca nggtggacct gctggacctg .cccgacgccc tgctgcccga . ' 720 

ctngcccgcg ctggcgggcc ccaagcagct gatcgzgctg ggaaacaaag cggaccccct ' 780 

gccccaggat gctcctggct accggcagag gctgcgggag cgactgtggg aggaccgtgc B40 

ccgcgccggg ctcctgctgg cccc tggcca ccaagggcca cagcgccccg tcaaggacga 900.. 

gccacaggac ggggagaatc cgaatccgcc gaactggtcc xgcacagtgg ccagggacgt 960 

gcggctgatc agcgccaaga ccggctatgg agtggaagag ttgatctctg cccttcagcg- '1020 

ctcctggcgc taccgtgggg , acgtctactt agtgggcgcc, accaacgccg gcaaatccac - -1080 

tctcttta_ac acgctcctgg agtccgatta ctgcactgcc aagggctccg acgccatcga 1140' 

cagagccacc. atctcccctt ggccaggtac tacattaaac cttctgaagt ttcctatttg 1200 

caacccaact ccttacagaa tgtttaaaag gcatcaaaga , crtaaaaaag . attcaactca 1260 

age tgaagaa gate t tag tg agcaagaaca aaaccagctt aatgtcctca aaaagcatgg 1320 

ttatgtcgta ggaagagttg gaaggacatt ctt'gtattca gaagaacaga aggataacat 1380. 

tccctttgag tttgatgctg attcacttgc ctttgacatg gaaaatgacc ctgttatggg 1440 

tacacacaaa tccaccaaac aagtagaatt gactgcacaa gacgtgaaag atgcccactg 1500 

gttttatgac acccctggaa tcacaaaaga aaactgtaLt ttaaatctta caacagaaaa 1560 

agaagtaaat attgtttcgc caacacagtc cattgttcca agaacctttg tgcctaaacc 1620 

aggaatggtt ctgtttttgg gtgctatagg ccgcatagat utcctgcagg. gaaatcagcc 1680 

agcttggtct acagtcgtgg cttccaacat cctccctgtg cataccacct ccttggacag 1740 

ggcagacgct ctgtatcaga agcatgcagg tcatacgtta ctccagattc caatgggtgg 1600 

aaaagaacga atggcaggat ttcctcctcL igctgctgaa gacatcacgt taaaagaagg I860 

actgggggca tctgaagcag tggccgacat caagttttcc tctgcaggtt gggtttcagt .1920 

aacacctaat tttaaggaca gactgcatct ccgaggctat acacctgaag gaacagtttt < 1980 

gaccgtccgg ccccctctct tgccatatat tgttaacacc aaaggacagc gcatcaagaa ■ 2040 ! 

aagtgtggr.c tataaaacca agaagccccc ttcccttatg tacaacgtga ggaagaagaa 2100 

aggaaagaca aatgtatgag accgaccttg ttcaccccag atattaactg caccgaacac 2160 

aacaaaatac actgaatttg tattaaacat ataacgcata aacaaagctc ccatcctt'ac 2220 

ccttaaaaaa aaaaaaaaaa *' 2240 



<210> 85 

<211p 1488 

<212> DNA 

<213> Homo sapiens 



<400> 65 
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cgccaagttc ccggagggag agggtagaaa ccggaggggg tggacctigtc actcacggga 60 

ctgagggtcc ttttctcccg cr.cccaggag gaacgagaac gaanatgac-' caagcccggg 120 

ttctggtggc tgcagtggrg gggttggtgg ctgtcctgct ctacgccccc acccacaaga 180 

tr.gaggaggg ccatztggct gtgtactaca ggggaggagc tttactaact 'agccccagtg' 240 

gaccaggcta tcatatcatg ttgcctctca ttact&cgtt cagatctgtg cagacaacac 300 

tacaaactga tgaagttaaa aatgtgcctt gtggaacaag cggtggggrc atgatctata 360 

ttgaccgaac agaagtggtt aatatgttgg ctccttatgc agtgtttgat atcgtgagga 420 

actatactgc agattatgac aagaccttaa tcctcaataa aatccaccat gagctgaacc 480 

agttctgcag' tgcccacaca,, cttcaggaag tttacattga attgtttgat caaatagatg , 540 

aaaacctgaa gcaagctctg cagaaagact taaacctcat ggccccaggt ctcactatac 600 

aggctgcgcg cgttacaaaa' cccaaaatcc cagaagccatf aagaagaaat tttgagttaa 660 

tggaggctga gaagacaaaa ctccttatag ctgcacagaa acaaaaggtt gtggaaaaag 720 

aagctgagac agagaggaaa aaggcagcta tagaagcaga gaagactgca caagtggcaa 780 

aaattcggct ccagcagaaa gtgatggaaa aagaaactga aaagcgcacc^ tctgaaatcg 840 

aagatgctgc attcctggcc cgagayaaag cgaaagcaga tgctgaatat tacgctgcac * . 900 

acaaatacgc cacctcaaac aagcacaagt tgaccccgga atatctggag ctcaaaaagt 960 

accaggccat tgcttctaac agtaagatct atrttggcag caacatccct aacatgttcg 1020 

tggac^cctc atgtgctttg aaatattcag atattaggac * tggaagagaa agctcactcc\ 1080 

cctctaagga ggctcttgaa ccctctggag agoacgtcat ccaaaacaaa gagacjcacag 1140 

gttgatgcaa gaggtggaaa tgttctccat atcaagatgt ggcccaaggg gttaagtggg ' 1200 

aacaatcatt atacggactc ttcagattta cagagaactt acactccatc tgttccacct' , 126C 

ctcctgcgat agtcctgggt gbtccaccga ttggaggata gagccagctg tctgacacac 1320 

aaatggtctt ttcagccaca gtcttatcaa gtatcctata tgtattcctt tctaaactgc 1380 

:actcatgaa tgaggaaagt ( ctgatgctaa gatactgcct gcactggaat gttaaacact 1440 

aaatatataa caagctgtgc tctcctaagc tgaaaaaaaa aaaaaaaa , ■ 1488 



<210> 86 
<211> 3174 
<212> DNA 

<213> Homo sapiens 



<400> 8 6 

gcggacgctg grcscaaaca ctaaggcctg agcggtgaca : atcgaggcgei gatgacggtc 60" 

aacagggaat gcctcgtggg agaaaaaaga caat~ttatc ctcagcgctg- attttgagat 120 

gacgggctcg ggaaacgggc gtcgcagcat gaagtcgccg cccctcgtcc tggccgccct 180 

ggtggcctgc atcatcgtct tgggcttcaa ciactggatt gcgagctccc ggagcgtgga 240- 

cctccagaca cggatcatgg agctggaagg cagggtccgc aggcgggctg cagagagagg - 300 

cgccgtggag ctgaagaaga acgagttcca gggagagctg gagaagcagc gggagcagcr. 360 

cgacaaaatc cagtccagcc acaacttcca gctggagagc gtcaacaagc tgzaccagga 420 

cgaaaaggcg gttttggtga ataacatcac cacaggtgag aggctcatcc gagtgctgca 480 

agaccagtta aagaccctgc 'agaggaatta cggcaggcc? cagcaggatg tcctccagtt ,540 

. tcagaagaac cagaccaacc tggagaggaa gttctcctac gacctgagcc . agtgcatcaa 600 

tcagatgaag gaggtgaagg aacagtgtga ggagcgaata gaagaggtca ccaaaaaggg 660 

gaatgaagct gtagcttcca gagacccgag tgaaaacaac gaccagagac agcagctcca 720 

agccctcagt gagcctcagc ccaggctgca ggcagcaggc ccgccacaca cagaggtgcc 780 

acaagggaag ggaaacgtgc ttggtaacag caagtcccag acaccagccc ccagttccga 840 

agtggtr.r.f.g gattcaaaga gacaagttga gaaagaggaa accaatgaga tccaggtggc ■ 900 

gaatgaggag cctcagaggg acaggctgcc gcaggagcca ggccgggagc aggtggtgga 960 

agacagacct gtaggcggaa gaggcctcgg gggagccgga gaactgggcc agaccccaca 1020 

ggtgcaggct cccctgtyag tgagccagga aaatccagag atggagggcc ctgagcgaga 1080 

ccagcttgtc atccccgacg gacaggagga ggagcaggaa gctgccgggg aagggagaaa . 1140 

ccagcagaaa ctgagaggag aagatgacta caacatggat gaaaacgaag cagaatctga 1200 

gacagacaag caagcagccc rggcagggaa tgacagaciac atagatgttt .ttaatgttga 1260 

agatcagaaa agagacacca taaatttact tgaccagcgt gaaaagc'gga atcatacact' ■ 1320 

ctgaattgaa ctggaatcac atatttcaca acagqgccga agagatgacc ataaaatgt.r. 1380 

catgagggac tgaataccga aaaccgtgaa atgtactaaa taaaatgtac atctgaagat 1440 

gattattgtg aaattttagt atgcacctcg cgtaggaaaa aacggaatgg -ctttcaaac 1500 

agctttcggg gggtactttg gaagtgtcta ataagtjty cc acaattcttg * gcagtaggta 1560 
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tttcgtgaga agttcaacac caaaactgga 'acatagtt=t ccttcaagtg ttggcgacag 1620- 

cggggcttcc tgattctgga atataacttt-gcgtaaatta acagccacct acagaagagt 1680 

, ccatctgctg tgaaggagag acagagaact ctggg-tccg ccgtcctgtc cacgrgctgt 1740 

accaagtgcr. ggtgccag'cc tgttacctct tctcaccgaa aagtctggct aatgctcttg 18O0 

tgtagtcacT: tctgattctg acaatcaatc aatcaatggc ctagagcact gaccgttaac 1860 

acaaacgtca ccagcaaagt agcaacagct ctaagtctaa atacaaagct gttctgtgtg 1920 

agaatttttt aaaaggctac. ttgtataata acccttgcca, "ttttaatgt acaaaacgct 1980 

■ attaagtggc ttagaatttg aacatttgtg gtctttactt actttgcttc gtgtgtgggc 2040 

aaagcaacat cttccctaaa tatatattac. caagaaaagc aagaagcoga ttaggttttt 2100 

gacaaaacaa acaggccaaa agggggctga cctggagcag agcacggtga gaggcaaggc 2160 

atgagaggyc aagtttgttg' tggacagatc tgtgcctact ttattactgg agtaaaagaa 2220 

aacaaagttc attgatgtcg aaggatatat acagtgttag .aaattaggac tgtttagaaa 2280 

aacaggaata caatggttgt t tttatcata gtgtacacat ttagcttgtg gtaaatgact 2340 

cacaaaaccg attttaaaat caagttaatg tgaattttga aaattactac Ctaatcctaa 2400 

ttcacaataa caatggcatt aaygctcgac ttgagttggc - tcttagtatt atttatggta 2460 

'aataggccct taccacttgc aaataactgg ccacatcatt aatgactgac ttcccagtaa 2520 

ggctctctaa ggggtaagta <jgaggatcca caggatttga gatgctaagg ccccagagat 2560' 

cgtttgatcc ' aaccctctta ttttcagagg ggaaaatggg gcctagaagt tacagagcat 2S40 
ctagctggtg cgctggcacc cctggcetca cacagactcc cgagtagcty ggactacagg - 2700 

. cacacagtca ctgaogcagg ccccgtctgc aactcacgtt gccacctcca acrtaaacat * 2760 1 

ccttcacacg ' -gatgtcctt agtcactaag gttaaacttt cccacccaga aaaggcaact 2820 

tagataaaat cttagagtac ' tttcatactc ttctaagtcc tcttccagcc tcactctgag 2880 

tcctccttgg ggttgatagg aattttc-ct tgcttcctca ataaagtctc tattcatctc 2940 

atgtctaatt tgtacgcata gaattgctga gaaataaaat gttctgttca acttaaaaaa 3000 

"cttgtcacag ctctgt tttt ' ttgcttttcc ccttcatcct ctattagatc cttaaagaat - 3060 

ccacccccaa actgcggac^. aaggatggca gtttgcctta aaccttctgt ggaatgagat 3120 
gggccgtcag gacgtcttca acccacagta ccccgaaagc tttgr.gcr.ra ccac - 3174 



<210> 87 

<211> 2780 - . ' . 

<212> DNA 

<213> Homo sapiens 

<220> . 

<221> SITE ' 

<222> (2760) . ' 

<222> n equals a.t.g, or c 



<400> 87 

ggcgccgcgg gcggggggtc ccggggccac gtggacgccg acggctctgg tctgcggcgt 60 ■ 

gtggacactg ccagcggctg cagccgactt ggaatgacct gggagacaaa tacaacagca 120 

tggaagakgc caaagfcctat gtggctaaag tggaccgcac ggcccactcc gacgtgtgct 180 

ccgcccaggg ggtgcgagga taccccacct taaagctttt uaagccagg'c caagaagctg 240 

tgaagtacca gggtcctcgg gacctccaga cactggaaaa ctggatgctg cagacactga 300 

acgaggagcc agtgacacca gagccggaag tggaaccgcc cagtgccccc gagctcaagc 360 

aagggctgta tgagctctca gcaagcaact ttgagctgca cgctgcacaa ggcgaccact 420 

ntatcaagtt ct-.tcgctccg tggtgtggtc actgcaaagc cctggctcca acctgggagc 480 

agctggctct gggccttgaa cattccgaaa ctgtcaagac tggcaaggcc gatcgtacac 540 

agcactatga , accctgcccc ggaaaccagg ttcgtggcta tcccactctt ccctggttcc 600 

gagacgggaa aaaggtggat cagtacaagg gaaagcggga tttggagtca cegagggagt 660 1 
acgtcgagtc gcagctgcag cgcacagaga ctggagcgac ggagaccgtc acgccctcag * 720 

aggccccggt gctggcagct gagcccgagg ctgacaaggg caccgtgtcg gcactcactg 780' 

aaaacaactt cgatgacacc attgcagaag gaaraacctt catcaagtct tatgctccat 840 

ggtgtggtca ctgcaagact ctggctccta cttgggagga actctctaaa aaggaattcc 900 

ctggtctggc gggggtcaag atcgccgaag tagactgcac tgctgaangg natratctgca 960 

gcaagtactc ggtacgaggc taccccacgt tattgctttt ccgaggaggg aa'gaaagtca 1020 

gtgagcacag tggaggcaga gaccttgacc cgttacaccg ctttgccctg agccaagcga 1080 

aagacgaact ttaggaacac agttggaggc cacctctccc gcccagctcc cgcaccctgc 1140 
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gtttaggagt tcagtcccac agaggccact gggttcccag tggtggctgt tcagaaagca 1200 

gaacatacta agcgtgaggt atcttctttg tg-gtgtgtt ttccaagcca acacactcta 1260 

cagattcttt attaagttaa gtttctctaa gtaaatgtgt aactcatggt caccgtgtaa 1320 

acattttcag tggcgatata tirccctttga ccccctcrtg atgaaattta catgcttrcc 1380 

tttgagacta aaotagcgct gagggaaatg aaattgcxgg actatttgtg gctcccgagt 1440 

tgagtgattt tggtgaaaga aagcacatcc aaagcatagt ttacctgccc acgagttctg 1500 

gaaaggtggc cttgtggcag tattgacgtt cctcrgatct taaggtcaca gttgactcaa 1560 

zactgtgttg gtrrcgtagca tggagcagat tgaaatgcaa aaacccacac ctctggaaga 1620 

:accttcacg gccgccgctg gagcttctgt tgctgtgaat acttctccca gtgcgagagg 1680 

ttagccgtga tgaaagcagc gttacttctg accgtgcctg agtaagagaa tgctgatgcc 1740 

ataactttat gcgtcgatac ttgtcaaatc agttactgtt caggggatcc ttctgtttct 1800 

cacggggtga aacatgtctt taQttcctca tgttaacacg aagccagagc ccacatgaac 1860 

tgttggatgt cttccttaga aagggtaggc atggaaaatt ccacgaggct cattctcagt 1920 
atctcattaa ctcattgaaa gattccagtt gtatttgtca cctggggtga • caagaccaga . 1980 

Raggett tec caggectggg- tatccaggga ggctctgcag ccccgctgaa gggccctaac 2040 

tagagttcta gagtttctga .ttctgtttct cagtagtcct tttagaggct tgetataett 2100 

ggtctgette aaggaggtcg accttctaat gtatgaagaa tgggatgeat ttgatctcaa " 2160 

gaccaaagac agatgtcagt gggctgctcc ggccctggtg tgcaeggctg tggcagctgt 2220 

tgatgccagt gtcctctaae tcatgctgtc crtgtgatta aacacctcta Lctcccttgg 22*80 

gaataagcac atacaggctt aagctctaag ataggtgttt gtccttttac catcgagcta 2340 

cttcccataa taaccacttt gcatccaaca ctcttcaccc acctcccata cgcaagggga 2400 

tgtggatacc tqgcccaaag taactggtgg taggaatctt agaaacaaga ccacttatac 2460 

tgtctgtctg aggcagaaga. taacagcagc atctcgacca gcctctgccfc taaaggaaat • 2520 

cttta-taat cacgtatggt ccacagataa ttcttttttt aaaaaaaccc aacctcctag 2580 

agaagcacaa ctgtcaagag tcttgtacac acaacttcag erttgeatea cgagtcttgt 2640 

attccaagaa aatcaaagtg gtacaatttg tttgtttaca ctatgatact ttctaaataa 2700 

actctttttt tttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaggggg gggggggggn 2760 

ggggcccccc cccccccaaa 2780 



<210> 88 
<211> 1061 
<212>DMA 

<213> Homo sapiens- 



<400> 88 

aatteggcac gagagaagga aatacatcaa aatgcccaca ttggttatct gtagaggata 60 

gaatgaaaga tggctttatt ctcttgtttg ctcttattaa agcaatcaga tggtgcttct - 120 

cctgtactca gagccctggc 'tgrttcctgc ctggcctctc 'ctgcgggctg ctgtggaacc 180 

agaaaagect taaacggaaa tgtgggagag aaggttggat tcactttcat gtctttccag 240 

ggttgtgacc cctcaagtcc tggttgcctt .tgctgttctc tattaccttc aaacagccag 300 

ctegtcttta tttctttttt agttttgtcg gggttggctt gatagatgtt agtccatcat 360 

agecagaegt gtctagcett gtcttttgaa tgcaagattt aggatgtggg tacttagctg 420 

ttageggaca tcagagtcoc tagtcoggat gaaayagCLe ttggctttaa ctcccagaaa 480 

ttctggtaac gtcatgtata gtgacggccg catgtctaac aggtggccag gtaagtcttt 540 

tggggtggtc tgtgaatcac agtttgggag acattgactt ttagggagtt tgttctgaat . 600 

tcactagata atagagatat aatacagagc tttgaaagct ggtgtctcga tgacagagee 660 

gtggcaatgg ggagggttga ggaggtggct gttgggcctg -ctcccggtg agagttgaoo • 720 

gggectgaac tcaagcagag gcctcagaac cgaaaggtgg tggaaggatg cagcaagagg 780 

cgccacacag gagcactctg cgccccggca gggtctcaat acacgtggga gtggtgagag 840 

ggagaacttt aagtccaggt tttgtgcctc agtgacttag tgtggccata tcattagaaa 900 

. tgtgttgagg ccgggcacag tggctcatgt ctgtaatccc agcactttga gaggctgagg 960 

caggaggatg gettgaggee aggagttraa aaccagcctg gacaaca~ag egagagectg 1020 

tctctacaaa aaaaaaaadd aaaaactcga gggggggece g • . 1061 



"<210> 89 
<211> 1342 
<212> UNA ' 



WO 00/06698 



59 



PCT/US99/i7l30 



<213> Homo sapiens' 



<4Q0> 89 ' . ■ , # . , . ' 5 . ' 

ccacgcgtcc gggegcgcgg gtgaaaggcg cattgatgca gcctgcggcg gcctcggagc ' 60 

gcggcggagc agacgctgac cacgttcctc tcctcggtct cctccgcctc cagctccgcg 120 

' ctgcccggca gccgggagcc atgcgacccc agggccccgc cgcctccccg cagcggctcc 180' 

gcggccccct gctgctcctg ctgctgcagc- tgcccgcgcc gtcgagcgcc tccgagatcc . 240 

ccaaggggaa gcaaaaggcg cagctccggc agagggaggt ggtggacctg tataatggaa i 300 
tgtgcttaca agggccagca r, ggagtgcctg gtcgagacgg gagccccggg gccaatggca * ' 360 
Ctccgggtac acctgggatc* ccaggtcggg atggattcaa aggagaaaag. ggggaatgtc , 420 

tgagggoaag. cttLgaggag .tccrggacac ccaactacaa gcagtgttca tggagttcat 480 

tgaattatgg catagatctt gggaaaattg cggagtgtac aittacaaag atgcgttcaa : ' 540 

' atagtgctct aagagzttzg ttcagtggct ■ cacttcggct aaaargcaga aatgcatgct 600 

gtgagcgttg gtatttcaca ttcaatggag ctgaatgttc aggacctctt cccattgaag 660 

"ctataattta tttggaccaa ggaagccctg aaatyadLtc'aacaattaat attcatcgca 720 

cttcttctg't ggaaggactt t gtgaaggaa , t tggtgc tgg attagcggat gttgctatct 780 

gggttggcac ttgttcagat tacccaaaag gagatgcttc tactggatgg aattcagttt 840 

c tcgca teat tat tgaagaa/ctaccaaaat aaatgettta attttcattt gctacctctt 900 

t-tttr.attat gccttggaat *ggttcactta aatgacattt raaataagtt tatgtataca -960^ 

tctgaatgaa aagcaaagcz aaatatgttt. acagaccaaa gtgtgatttc acactgtttt 1020 

taaatctagc attattcatt ttgettcaac ■ caaaagtggt ttcaatattn ctt'ttagttg 1080 

gttagaatac tttcttcata gtcacattct ctcaacctat aatttggaat attgrtgtgg 1140 

tcttttgttt tttctcttag tatagcattt ttaaaaaaat ataaaagcta ccaacctttg "1200 

, tacaatttgt aaatgttaag aatt:ttttt atatctc.tta, aataaaaatt, atttccaaaa t 1260 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ^aaaaaaaaaa , 1320 

aaaaaaaaaa aaaaaaaaaa aa \ : 1342 



<210>.90 

<211> 770 ', . 

<212> DNA > 
*<213> Homo sapiens 

<22C> 

<221> SITE • • • '• ' 

S222> (690), . 

<223> n equals a, t,g, or c 

<220> , ( • . ' . 

<221> SITE , 

<222> (762), ' 

<223> n equals* a> t, g, or c . 



<400> 90 , v ■ ! . ' ' ; 

ggtagggtcg ggkcgaccyr cgcgtccggc tcttccccag gcggcggcca tggegggaca ' * 60 
ggaggatccg gtgcagcggg agattcacca ggactgggct aacegggagt acattgagat * 120 " 

aatcaccagc agcatcaaga aaategcaga ctttctcaac" tegttcgata tgtcttgtcg 180 

ttcaagactt gcaacactaa acgagaaatt gacagccctt . gaacggagaa tagagtacat 240 

tgaagctegg gtgacaaaag gtgagacact cacctagaac agtgccgtgc tgctgctggg 300 

aagttgcttt acacaacaca ggccacatgg gaaaggcccc agcagccttc ; age tec t tee ' 360 

tttctcctta aagagcaaca gggcttattc ttgtttttct -tttttcaaaa gtgtggcctt 420 

tgggctctgc catctggggt gtggtgtggt atgtgggaagaagt-.tcagag gaaccgctgg : 480 •' 

aaacgacgtt aggcatttta ccttttcagt aacattttat acatctactt gecaatgtat > 540 

ttgagacatt cacagccaaa -agectgggae tctttgtgaa ggtcctcctc amctctatct 600, 

ttctttctct ctctctcaaa ctttccctaa agttctcatt gcctttgcac tgcttctgtg 660 
aacagtcttt gtctcctccc cacctt-ggn ggaaagtgcg gggcagtcct ggtcaagaca . 720 

ctcatgccct ggcaatgtgg ectgeagaga atgtrgttgt- anccaccagt 770 
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<210> 91 
<211> 1570 

<212> DNA . ; 

<213;> Homo sapiens 



<400> 91 

yatggtttta ctgaagctgc caaaagttta cagaaaagga agtgcaggaa catttcacaa 60 

atctacaatc tgtgagtatc acatcctgta tagctgtmaa cactggaata aggaagggct 120 

gatgact:tc agaagatgaa ggtaagtaga aaccgttgat gggactgaga aaccagagt't 180 

aaaacctctt tggagcttct gaggactcag ctggaaccaa cgggcacagt tggcaacacc 240 

atcatgacat cacaacctg: tcccaaLgay accaccacag tgcLcccatc aaacgtcatc 300 

'aacttctccc aagcagagaa acccgaaccc h accaaccagg ggcaggatag cctgaagaaa '360 

catctacacg cagaaatcaa agttattggg actatccaga tcttgtgtgg catgatggta 420 

ttgagcttgg ggatcattnt ggcatctgcc t tccttctctc caaattttac ccaagtgact 480 

'tctacactgc tgaactctgc ttacccattc ' ataggaccct ttttttttat catct=Lygc 540^ 

tctctatcaa tcgccacaga gaaaaggttr accaagcttt tggtgcatag cagcctggtt 600 

ggaagcattc cgagtgctct gtctgccctg gcgggtttca ttatcctgrc tgtcaaacag 660 

' gccaccttaa atcctgcctc actgcagtgt gagttggaca aaaataatat accaacaaga 720 

agtcatgttt cttarf.tf.ta tcatgattca , ctttatacca cggactgcta Cacagccaaa '. '780 

gccagtctgg ctggawctct ctctctgatg ctgatttgca ctctgctgea - atictgccta 840 

gctgtgctca cLyccgcgct gcggtggaaa caggcttact ctgacttccc tgggagtgta .; 900 

cttttcccgc ctcacagcta cattcgtaat tctggcatgt cctcaaaaat gacccatgac. 960 

tgtggatatg aagaactatz _ gacttctcaa gaaaaaaggg agaaatacta accagaaagt 1020 

tgattcttat gataatangg aaaagttaac cattacagaa aagcaaagct tgagtttcct 1080 

aaatgtaagc, ttttaaagta atgaacatta aaaaaaacca ttatctcact gtcattcaag 1140 

atatgtgttc attggggaLc tcttgacttg cctgacattg acttcagcaa aagcacgggg 1200 

ctgtaaatta ccatttacta gatcagccaa atagtccgaa tttccagaaa acaaggcaga 1260 

• atgaccattCccagaaacat ttcccagaaa atgtttccca gaaaactaga cagmatgatc 1320 

attcaatgga tcacagtgaa gcaaaggaca caacttrtta ctgtacccct taattgtcaa 1380 

caggagttaa.ctgatttgtt gtggtgctca. gactttttta tacaggtgct agtgttttat 1440 

cctatgtatt ttaactcatt agtgcataaa ggcaagcccc atataatgaa gtctcagggt 1500 

acatgaaagt agccggct^c aaaataaaat ttttgagtgc aaaaaaaaaa aaaaataaaa 1560 

aaaaaaaaaa ' 1570 



<210> 92 

<211> 2950 

<212> DNA 

<213> Homo sapiens 



<40O> 92 

cccggctccc gcccgctccc. agccgggccc cccagcggtc ggcgggacgg. ctcccggctg 60 
cagtctgccc gcccgccccg cgcggggg'cc gagtcgcgaa gcgcgcctgc gacccggcgt , 120 

ccgggcgcgc tggagaggac gcgaggagcc atgaggcgcc agctgcgaag gcggcggcgc 180 

tgctgctcgg gctgctcttg gagtgcacag aagccaaaaa gcatcgctgg tatttcgaag 240 

gactctatcc adcctatcac atatgccgct cctacgagga ctgctgtggc tccaggtgct 300 

gtgcgcgggc cctccccara cagaggctgt ggtacttctg gttccttctg atgacgggcg 360 

tgcfctttctg ctqcggagcc ggcttcttca tccggaggcg rraf.gtacccc ccgccgctga 420 

tcgaggagcc agccttcaac' gtgtcctaca ccaggcagcc cccaaatccc ggcccaggag 480 

cccagcagcc ggggccgccc tattacacyg acccaggagg accggggarg aaccctgtcg 540 

ggaattccac ggcaatggct ttccaggtcc cacccaactc accccagggg. agtgtggcct 600 

gcccgccccc tccagcctac tgcaacacgc ctccgccccc gtacgaacag gtagtgaagg 660 

ccaagtagtg gggtgcccac gcgcaagagg agagacagga gagggccttt ccctggcctt 720 

tctgtcttcg ttgatgttca ct:tccaggaa cggtctcgtg ggctgctaag ggcagttcct 780 

ctgataiccc cacagcaagc r acagctccct ttcaggcttt ccatggagta caatatatga 840 

actcacactt tgtctcctct gttgcttctg ■cttctgacgc acctgtgctc tcacatggta 900 

gcgtggtgac agtccccgag ggctgacgtc cttacggtgg cgtgaccaga tctacaggag 9S0 

agagactgag aggaagaagg cagcgctgga ggtgcaggtg gcatgtagag gggccaggcc 1020 

jagcatccca ggcaagcaue cttctgcccg ggtattaata ggaagcccca -gccgggcgg 1080 
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ctcagccgat gddgcagcag ccgactgagc tgagcccagc aggtcatctg ctccagcctg 1140 

tcc^'ctcgtc agcctccccc ntccagaagc tgttggagag acattcagga gagagcaagc 1200 

cccctgtcat gtttctgtct ccgttcatat cctaaagata gacttctcct gcaccgccag 1260 

gaaagggtag cacgtgcagc j tcccaccgca f gatggggcct agaatcaggc ttgcttggag - 1320 

gcctgacagt gatctgacac ccactaagca aatttattta aattcatggg aaatcacLtc 1380 

ctgccccaaa ctgagacatt gcattttgtg agctcctggt ctgacttgga gaaaggactg 1440 

ttacccattt ttttggtgtg tttatggaag tgcatgtaga- gcgtcctgcc ctttgaaatc 1500 

agactgggtg tgt'gtcttcc ctggacatca "ctgcctctcc agggcattct caggcccggg 1560 

ggtctccttc cctcaggcag' ctccagtggt gggttctgaa gggtgctttc aaaacggggc 1620 

•acatctggct gggaagtcac atggactctt ccagggagag agaccagctg aggcgtctct 1680 

ctctgagytc gtgttgggcc caagcgggtg tgtgctgggc tccaaggagg aggagcttgc 1740 

tgggaaaaga caggagaagt actgactcaa ctgcactgac catgttgtca taattagaat 1800 

aaagaagaag tggccggaaa tgcacactcc tggaraggaa tcacagctca ccccaggatc i860 

tcacaggtag tctcctgagt agttgacggc tagcggggog ctagttccgc cgcata'gtca 1920 

tagtgctgat gtgtgaacgc tgacctgLcc .tgtgtgctaa gagccatgca gcctagctga 1980 

ggcgcctaga ttactagatg tgctgtacca cggggaatga ggtgggggtg ct.atttttt 2040 

aatgaaccaa tcagagcctc fctgagaaatt gttactcatt gaactggagc atcaagacat 2100 

ctcatggaag tggatacgga gtgatttggt gtcbatgctt ttcactctga ggacatttaa '2160 

t-cggagaacc tcctggggaa tctcgtggga gacacttggg aacaaaacag acaccctggg *2220 

aatgcagttg caagcacaga tgctgccacc ag-gtctctg accaccctgg tgtgactgct '2280 

gactgccagc gtggtacctc ccatgctgca ggcctccatc taaatgagac aacaaagcac 2340,. 

aatgttcacL gtttacaacc aagacaactg cgtgggtcca aacactcctc ttcctccagg . 2400 

tcatttgttt tgcattt.tta atgtctttat tttttgtaat gaaaaagcac- actaagctgc 2460 

ccctggaatc gggtgcagct gaataggcac ccaaaagtcc gcgaccaaat ttcgtttgtc 2520 

tttttgatag caaattatgt Caagagacag tgatggctag ggcccaacaa ttttgtattc 2580 

ccatgtttgt gtgagacaga gtttgttttc. ccttgaactt ggttagaatt gtgctactgt 2640 

gaacgctgat cctgcatatg gaagtcccrc^ tccggtgaca tttcctggcc attcttgttt 2700 ; 

ccattgtgtg gatggtgggt rgtgcccact* tcctggagtg agacagctcc tggtgtgtag 2760 A 

aattcccgga gcgtccgtgg ttcagagtaa acttgaagca gatctgtgca tgcttctcct 2820' 

ctgcaacaat tggctcgttt ctctt ttttg* ttctc ttttg acaggatcct gtttcctatg 2880 

tgtgcaaaat aaaaataaac ttgggcaaaa aaaaaaaaaa* aaaaaaaaaa aaaaaaaaaa 2940 

aaaaaaaaag 2950 1 



<210> 93 
<211> .1722 
<212> DNA 

<213> Homo sapiens 



<400> 93 

ggcacgagcc agagcaggct gctaggcctg gggccaccac tgcccctggg tgctacaccc 60 

agtgtgctgg gtcactggga acttcctgaa gtgccgtcac ctgaactggg ' cccccaagga 120 

tggggtgcgg gcagcaccgc aggaagagga gcagcccccg tgaagattga gagctgccag 180 

aggctctgtg attggctgcg gcacgatgac ccgcgcacgg attggctgct tcgggccggg 240 

gggccgggcc cgggggacag aatccgcccc cgaaccttca aagagggtac cccccggcag 300 

gagctggcag accyaggagg Cgcgacagac ccgcggggca aacggactgg ggccaagagc 360 

cgggagcgcg ggcgcaaagg caccagggcc cgcccagggc gccgcgcagc acggccttgg 420 

gggttcngcg ggccntcggg tgcgcgtctc gcctctagcc atggggtccg cagcgttgga 480 

gatcctgggc ctggtgctgt gcctggtggg ctgggggggt ctgatcctgg cgtgcgggct . 540 

gcccatgtgg caggcgaccg cccrcccgga ccacaacatc gtgacggcgc agaccacctg 600 

gaaggggctg tggatgtcgt gcgtggtgca gagcacsggg cacatgcagt gcaaagtgta 660 

cgactcggtg ctggctctga gcaccgaggr. gcaggcggcg cgggcgctca ccgtgagcgc 720 

cgtgctgctg gcgttcgttg .cgctettcgt gaccctggcg ggcgcgcagt gcaccacctg 780 

cgtggccccg ggcccggcca argcgcgtgt ggccctcacg ggaggcgcgc tccacctgtt 840 
ttgcgggczg ctggcgctcg -tgccactctg ctggttcgcc aacattgtcg tccgcgagtt ■ 900 

Ctacgacccg -ctgtgcccg . tgtcgcagaa gtacgagctg ggcgcagcgc . tgtacatcgg 960 
ctgggcggcc accgcgctgc . tcatggtagg cggctgcctc ttgtgctgcg gcgcctgggt ' 1020 

ctgcaccggc cgtcccgacc tcagcttccc cgcgaag-ac tcagcgccgc ggcggcccac 1080 

ggccaccggc gactacgaca agaagaacta cgucLgaggg cgccgggcac. ggccgggccc 1140 ' 



WO 00/06698 



62 



PCT/US99/I7130 



ctcctgzcag ccacgcc^gc gaggcgttgg ataagcctgc qgagccccgc atggaccgcg 1200 

gcttccgccg ggtagcgcgg cgcgcaggct cctcggaacg tccggctctg cgccccgacg 1260 

cggctcctgg acccgctcct gcctgcgccc gcagctgacc ttctcctgcc actagcccgg 1320 

ccctgcccfcr. aacagacgga azgaagt t tc cLLttctgng. cgcggcgctg tttccatagg 1380 , 

cagagcgggf gtcagactga ggatrtcgct tcccctccaa gacgctgggg gtcztggctg, 1440 

ctgcct-acc tcccagaggc tcctgccgac ttccgagggq cggatgcaga Qcccagggcc 1500 

cccaccggaa gatgtgtaca gctggtcctt accccatcgg caggg=ccga gcccagggac 1560 

cagtgacttg gcctggacct cccggtccca ctceagcatc tccccaggca aggcttgtgg' 1620 

gcaccggagc ttgagagagg gcgggagcgg gaaggctaag aatctgctta gtaaatggtt 1680 

ugaacnccca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1722 



<210> 94, . 
'<211> 635 
<212> DNA 
<213> Homo sapiens 



<400> 94 , " * 

gcctaoagag agctccccca yyaccagccc tggccaaggg at-gctgcag ccctcatcca 60 

cctrccaagc actggaaaca aacattggag accaagtgag gcgtcactca acacccgtag 12ft 

taatcaggga aatgacaagt . tacatactga tat'ccttcgt tttgctgatt ggagttgggt 180 

gcattgaaaa agatcagtcg tgeccagtgt . ttgggggaag gaagcgtctt cacctgttgt 240. 

ttgtggeagg acagttgagg caggtgagga fcgcngagagg tgagctcagc tgtgcctgtt 300 \ 

accgcccaca tgtgcaagcc cttcagctcg gtggrtgtac ttgtttttga gatgcagttt 360 

cactcttgcc acccaggctg gagtgcatgg catgazcttg gctcgctgca acatccgcct .420 

cccgggttca agcgattctc ctgtccctac naaaaataca . aaaat tagct gggtgtggtg 480 

gtgcg^gcct ttaatcccag rtactcagaa ggctgaggtg caagaaccgc ttgaacctgg 540 

gaggtggagg ttgccgtggg ccgagatcac gccaccgcac tccagcctag -gcaacagagc 600 

tagactgtct caaaaaaaaa aaaatgaccc tcgag 635 * 



<210> 95 

<211> 3798 - 

<212> DNA 

<213> Hoir.o sapiens 



<400> 95 ,. . , 

ccacgcgtcc ggggcttcaC acaggaaatc caccgctgtg tcaagttcca gagaaaagct 60 

tctgttcgtc caagttacLa accaggccaa accacataga cgcgaaggaa ggggctagaa 120 

ggaagggagt gccccactgt tgatggggta agaggatcct gtactgagaa gttgaccaga ' 180 ! 

gagggtccca ccatgcgcac agttccttc't gtaccagtgt ggaggaaaag tactgagtga 240 
agggcagaaa aagagaaaac agaaatgctc tgccctrgga gaactgctaa cctagggcta ' 300 . 

ctgttgattt tgactatctt cttagtggcc gaagcggogg gtgctgctca accaaacaac 360 

tcattaacgc tgcaaactag caaggagaat catgccttag cttcaagcag tttatgtatg 420 

gatgaaaaac agactacaca gaactactcg aaagtactcg cagaagctaa cacttcatgg 480 

cctgtaaaga tggctacaaa tgctgtgctt cgttgccctc ctatcgcacn aagaaactr.g ' 540 

atcataaraa catgggaaat aatcctgaga ggccagcctt cctgcacaaa agcccacaag; 600 

aaagaaacaa aLgagaccaa ggaeaccaac tgtactgatg , agayaataac clgygLcLcc; .660 

agacctgatc agaattcgga cctccagatc cgtaccgtgg ccaccactca tgacgggtat 720 
tacagatgca taa-ggtaac acctgatggg aatttccatc gtggatatca cctccaagtg - 780 E 

ttagttacac ctgapgtgac cctgt^tcaa aacaggaata gaactgcagt atgcaaggca 840 
gttgcaggga agccagcLgc gcatatctcc tggarcccag agggcgattg tgccactaag ,' SOO 

caagaatacL" ggagcaarcg cacagtgact gttaagagta catgccactg ggaggtccac 960 

aotgtgtctia ccgtgaactg ccacgtctcc catttgactg . gcaacaagag tctgtacata- 1020 

gagctacctc c-gttccagg tgccaaaaaa tcagcaaaat tatatatccc atatatcatc 1080 

cttactatca tcatctcgac ratcgtggga tccacttggt rgt-gaaagt caatggctgc 114C 

agaaaacata aattgaacaa aacagaatct accccagczg ttgaggagga cgaaargcag 1200 

ccctacgcca gctacacaga gaagaacaat cccctcca-g acactacaaa caagccgeag 1260 
gcacctigcigg cattacaaag tgaagttgac acagacctcc atactttat'a agttcctgga . 1320 
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ctctagtacc aagaaacaac aacaaacgac atacattata attactgtct gattttctta 1380 

cagttccaga atgaagactt atat tgaaa t . taggttttcc aaggttctta gaagacattt ,1440 
taatggatrc tcattcatac ccttgtataa ttggaatttt tgattcttag ctgc taccag ' : * 1500 

ctagttctcc gaagaactga tgttattaca aagaaaatac atgcccatga ccaaatattc 1560 

aaattgtgca ggacagtaaa taatgaaaac caaatttcct caagaaataa ctgaagaagg 1620 

agcaagtgtg aacagcttct tgcgtatcct ttcagaatat tttaatgtac atatgacatg 1680 

tgtatatgcc tatggtatat gtgtcaartt atgtgtcccc ttacatatac atgcacatat 1740 

ctttgtcaag gcaccagtgg gaacaataca ctgcattact gttctataca tatgaaaacc 1800 

caataatata agtcttagag atcattttat atcatgacaa gtagagctac ctcartcttt. I860 

ttaatggtta tataaaattc cattgtacag ttacaLcatt atttaattaa aaacaaccct 1920 

aacgatggat atttagattc ttttaagttt tgtttattcc ttttaagttt tgtttgtggt 1980 

ataaacaata ccacatagaa tgtttcttgt gcatatatct ctttgttttt gagtatatct 2 040 

gtaggataac tttcttgagt ggaattgtca ggtcaaaggg tttgtgcatt ttactattga 2100 

tatatatgtt aaattgtgtc aaaLatatat gtcaaattcc ctccaacatt gtttaaatgt 2160 

gcctttccct aaatttctat tttaataact gtactattcc tgcttctaca gttgccactt 2220 

■tctcttttta atcaaccaga ' ttaaatatga tgtgagatta taataagaat tatactattt 2280 

aataaaaatg gatttatatt tttggtcatg tttgtaagag agtgaatgca cgtgtgagaa 2340 

cattagctcc ttctgaoctc attatatctc cacacaggtg Ctgat.acttg atgcctaaca 2400 

g-tttgcaga Lgtgctacat tggaattgtg tatttttatg gtgtacattc ; tattgtgata 2460 

tatttattga acaattaatg tctattgacc atataagtgg cgaaaaatyc accatagagg 2520 . 

acatggggta tttatttaca aactatgagc tacataataa gcaagtggco atgggatggc 25B0 
a-.gaccctcc cctccatart tttgtggagc aaaatattgg caatgtttat gtaaatcatt - 2640 

gctaatatca cgaaactott tttaattaaa aacataagtc tatttgctcc atagcagaaa 2700 

aaacatgaga ag-ttrttca tcatgataga, aattgaaaca aaatatattc attcttcaat 2760 

cacaccatct gagattttca agacagctat tttgtcttat aagtatattt ttctccctct 2820 

agacatttca gttactatgg attttgtcct caaagggacc tttagtctat tttggatgta 2880 

aagctaatct aatgacactt ggcacatgat attttgatca agccattttg acttgaccaa 2940 

aaagcagtgt ccactaggtt tctgcataca aatattacca agcaatgttc acaatagaca 3000 

tcattacact gtccttgaaa tttattaatt cttcatccaa ccctggttga gctgaggctc 3060 

atagttaggc tcaagacta: ctgtttaaat attactgaaa aacaaagtaa gacagtacta' 3120 

tgcttacctc ttaacttgat aatgtcaaaa caggcatgtt aaatgacate atagaaaaga 3180 

dttcaagata atttatagaa gttaaatta't attgtacaga aaataattgt atgaaaatct 3240 

ctactatggg gccggaacat ggttgaacat tagaatgata taaaaaatta tatatattct 3300 

ccaaatccac gctagacccg zcaaattaga gaatctagag actagacctg gcgtgzcagc 3360 

aaggtcatcc aggaagcaga ggctgagacg gagttaggtg tyattactta catagtcgat 3420 

tacatttt'ac aaataacaht ctatatgtct catrtactgt gctttctccc caccccattt 3480 

tgtatctttt'.ccctcgcttt gctagatttg -tcaat'tctct ctctctttct gtctctctct 3540 

ctttcaatat ctctoataat ctgaaagtaa ttcatcataa ccaaatatct attggggtta 3600 

tgcttcactt acaaacttct gaaaacggct ttactgagat ataattgata tatttaagtg 3660 

tacagtt'tgt- taaattttgc acatatttaa aatgtggact ttgg'taaatg tcgacatagt 3720 

ttt'acatctg cgaaaccatc agcataatca agacaataaa cttgtccatc accccccaaa 3780 

aaaaaaaaaa aaaaaaaa 3798 



<210> 96 
<211> 2683 
<212> DNA 



<221> SITE ;. 
<222> (2640 J 

<223> n equals a,w,g, or c 
<220> 

<221> SITE 
<222> {2676} 
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<400> 96 
acaasgtmac 
gcatttgcaa 
tgttgcgggg 
gatrccgttt 
tgcattccct 
.tgtatgtatc 
aatcagttgc 
ttatcaacta 
ctgagaatat 
gacctatgaa 
tgtatttaaa 
cgcaagatct 
ttgtgcagcc. 
tcctaaaaac 
, tgataaacag 
gagtatacct 
gtatgacata 
tcataatgga 
gggaaggaaa 
tcaggtgtgg 
ggcataatga 
gagttgatga, 
aaatgatcaa 
attaatggtt 
ccactaaatc 
gtgtgtgtat 
Ctcccctacg 
caataaaaaa 
tgaatctggc 
ataatttgtg 
'tgatttttac 
tagttttctt 
ctttgtgtat 
tactttcttc 
agagaagaac 
tgtccacagg 
gaqtttgtct 
ctgttggtct 
gtcctgtact 
ttttttccta 
caaatagcag 
tagtaggtat 
acatctgtat 
cttaattact 
ccatgccata 



gcctgacagg 
taacagaaaa 

agggggtcgg 

gtgtgtgcat 
gtgtattttc 
tttctgtcat 
ttgcttcttt 
ccccaattgc 
gttctagaga 
taatacacct 
ccaagtctgt 
tcataccaaa 
gacagarg ta 
aaattgtttg 
ttgaaattca 
attttcttag 
ggaaagtcat 
acatgtaaac 
agtggtzcta 
aaaatatttt 
ctgctggttt 
. ggaccacctt 
ctttaatcca 
ctcatgactt 
gttgacttga 
atatgcattt 
catgtgtatt 
aaaaccaaaa 
atcacagtga 
tctaaatttc 
attgctttag 
cctgtttttt 
gatgctgaat 
agttgtagaa 
aggtaatagg 
agaaactaac 
ccatcagttg 
catggtcaat 
atgtctgtaa 
gcctacgctc 
ttctcagaat 
gcatatgttt 
gcacacaf.ct 
tcgtgagtgr- 
cttggttgga" 



tnuiccgga cc 
ggaactgcat 
gggatagtrt 
attttatcca 
ag"gacatggc 
cactyttccc 
caaagtgctt 
atgttctcca 
tattggaata 
tagtctagtt 
gaatacctgc 
gtaggaagta 
atgrttacatc 
gaaagtacaa 
agatgtgttt 
cgtatctgcc 
ttttntttag 
acactgaaaa- 
ctaatgctcc 
ttag ttgatt 
tccagactgg 
tgtgtataca 
ggtagaattc 
gtgtggcatc 
att _ tgggaa 
ataaacaagt 
ccacacacaa 
cagattgtca 
gcaaatgtcg 
tgt Uggggta 
gaaacatttt 
tatgtgtcat 
tacaaaccgt 
aaggtcaatt 
gagaaataaa 
cattcagttg 
ttgactttcc 
atcaatcaga 
ctactaagtt 
actccccaac 
ctagtcaagt 
gtttctgtac 
ttgataaaat 
tattggatac 
ctat tttggg 



cgggaattcc 
gtatgaagtt 
■gatttccatt 
gccttaagtt 
tcgtgggtg- 
tctcctcccg 
cgaaggtctt 
ccacatattc 
aagctgtctg 
cttLattcna 
tttttttggc 
aaaatagctt 
acctaaaaaa 
tgcaccacat 
gacccctagt 
ttgtttatct 
aattcatgga 
ctgcttttca 
aaaatcctca 
gagaatgcag 
attttcciac 
ctrgtagttt 
aagatggctg 
taaaaataat 
aaaaaaaagt 
gcgtttgagt; 
atggctgagt 
gttaaccagg 
tactaattta 
gaaattacta 
tactaattca 
cgr.tagtggt 
ttgaatgatc 
tccctcagtg 
catacggtigg 
tcttaatttt 
aaaatgttgc 
ctttcatgat 
taaagaaaag 
ccatcccaac 
tgccatcatc 
agtactgcgt 
agctattLga 
atcttaaaaa 
gcattnaaaa 



cgggtcgacc 
ttcaatcgtg 
ttccgaaaac 
ataaagctca 
gtgtgttcat 
agtgtgcatt 
gaactcatgt 
tcttatttgc 
ggtaaggagc 
aatctggatt 
cacagagtaa 
agaaagctct 
gaaagataca 
Ltttgtagaa, 
catttttact 
ttttcttcac 
tcagtctgat 
ggagagaaac 
tcagagaagg 
gtttaacaga 
cgcaactatt 
taaaccttgc 
tacttcagtt 
gtttttatag 
tggtgttgat 
aacaagtgag 
' tatagtcata 
aaacagttaa 
ggctaatttc 
aaattgtggg 
gctgtcttag 
ctcagaattc ' 
cagttgaaaa 
tcccacatta 
tttcagtggt 
agttcgttct 
attaagtaat. 
utctactaat 
cacatagtca 
attgacatgc 
ccccttgcct 
gtgtgtgtgc 
ctagcagggt 
aaaaaaatct 
teg 



cacggcgtcc 
ggectttett 
gacagacttg 
tctgtcccgc 
tgtgtgcgtc 
cagttaatat 
gtgagcatct 
tctgtacccc 
aggcttagec. 
gecagtattg 
caagttttca 
gtcaggtgtt 
eggtcagtta 
gtc-actatc 
Ctttggttct , 
cttttaacaa 
ct&ctcttat 
acgagttgga 
tatgatgttc 
agagataagg 
aatgttctca 
attggtaaca ' 
gtatgataaa 
catctctctg 
atgtatatgt 
tttcatagtc 
aaacaatttg 
tgttttttaa 
taa'tactacc 
gagttttttc . 
gtaaaatgaa 
tgatcagtaa 
cgtatccctc 
taccaaccta 
ettggtcatg 
accctgtgag 
agttgtcact 
tattegtaga 
cttcatctct 
tatctgtgga 
-ggccgttca 
atacatatat 
taaagtggct ' 
ggacccagaii 



60 
12 0 
'180 
240 
300 
360' 
420 
480 
540, 
600 
660 
720 
.780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
• 1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
; 2160 
2220 
2280 
2340 
2400 
2460 
2520 ' 
2580 
2640 
2683 



<210> 97 . , : t 

<211> 2181 
<212> DMA 

<213> Homo sapiens 

<220> • . ■ 

<221> SITE ' 
<222> (5) 

<223> n equals a, t,g, or c 



<400> 97 
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<210> 98 

<2U> 1957 

<212> DNA 

<213> Homo sapiens 

<400> 98 



til llllctl TaT ggC9 ' C "=«c^ 3 gtcgeccgcg ctccgggcgc 
g^cgggggc cftaacacaa nit CS9a9 ^ a ^Scgggtgg cgcccgggcg 

acaactcgag gctccaaacc cro^r C^ggoccgc catcctgcag ggactcacgg 
gag tg g t | C g S J^SSS £££ " ^-tcg Ctt g g3gaC a 

cacccccatt Lggcctgcta a ^ 9 tctctttcgg cactacrccc 

gcccgggggt gcccttccal >™ 9 Ctc - ca S«" c ?g ctttggaatc gcaggagccg 

ssr Er" ™- £££ 

catctcacc tlccccggct agccaaaaca gctggcagcja ctgccggtca' 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



'ESteKUK lllli ca9t * 
■ £K£ c'Efes 

tggctatctt g?gcaita=? ttgctcc^a aaa^f!"^ ^""J. 
attcgtgctt gacagcag-g ct-ccacaat ^11 cc »«ccca agaatgtggt 

cctcttcaca .attcLc!fc g acccccalcc III™ , ccaaggacgc 
' caaccogatc aaagtacgga aggacc!ctt T "cagtatca . tcggattttc 

t9 ggaaag tg tacattcalc 2S = c S^a 
gcagagggcc atcaggctcc ""reaggc acagacatca acggggtcct 

cgtgtcccrc at cg?2ttcc ga " ! a t ! 9tBSCCCaC "«tweatt B gacaccggag 
caagatcctc aacaacaccc oaalaa^ 9aagccca C g gtcggggaga cgcacaccct 
catcggcaac gacgrggact tcIocS CC9a99ccaa ?^g C at CC tcacc attgg 
cracacggcgc gtqcafoa™ *™ 9 9Bagaaac t g tcgctggaga actgcggcct 
aatcaggacc cc*£?£? ctaa^f^ a " CtCsca * ^atcgggt tcjcgatqa 
ggccaccaag ac^S STSffiS'-SSg 

sss™ =: 

aggaagcccc aggccCggag oS™! 9C59CCtM3 -re«=-»ffga aagatgtcac losi 
ga g =tacc=c accacaaagg agc^ctgag aaCCacatC 3 agcgtctctg ' info 

« SS» ^ 0 

sssa ss5-~ iF- «ss 

agtggatggt garccccact t?«??»h« c = a 9 ccaa 9 a attaaaatct ctaaaacatc 
caaca tt gl t ; ggc ^:^ ££££ ££££ 

2^s.5ses: ----- 

gatcacaccg agcagagtca tcttaa^S " tCaaca * g "agagagat cttatctcga 
gagtgtggtg g^ggggagct aaoak^n" tgg9gacaga ^ggtgctcc cctgcaacca 
cgtcaccatc ca£2[££ Wtgtccgtg tccgccaacg ccaatgtcac 

. .ccc.tccag ,-ga g cIccacc tgg g^tcL — ^aca aaaagccggc 

ctgccacgga ctgctgggtc J??«f"! cattgecaac agcgagggcc .tttccagcaa 
agggcccagc cagaacctca c aZlltl^ 39actcaca ^ "gaccctgc 

cgtcccaaea gtgaaaagcc c Sctccttcag gtgggagagg ggcctgaggc 

cggggaagag Law L ovta*^ agtggtCt ^ ^gcaaagga agatttacaa 

^agcacagg^ tc ;cctggca t y ^ ^ Cagg3aCat * -gccaaact gattgacggg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2181 



60 
120 
180 
240 
3C0 
360 
.420 
480 
540 
600 
660 
720 
780 
840 



WO 00/06698 



PCT/US99/17130 



66 



cacaccagca 
gcctcagccc 
cagcaaaagc 
gcggtcgggg 
tcccctggac 
ctcagagatg 
ggtcaggggg 
cagagggcag 
ccaccLgcag 
acggcccccg 
tctgtggcac 
gcca tcggca 
tcgtccatgt 
accccacctc 
acgtgcacac 
gcaggcctgc 
ctct tccagc 
tgaccacacc 
aaagccntcg 



cgtcccacct 
agcccacccL 
gat get gaga 
eggecaggaa 
caggacgcca 
aatgcctcaa 
ctccgggtca 
teatggggee 
gtgctccccg 
catcagctca 
aggggcacac 
geaagegaca 
ccgccatgta 

CCCCCtCCdL 

acacagaccc 
tgggcccgga 
tccctggggg 
ccaatctcag 
acaaacaaaa 



cgtgctcacg 
cacctgcttt 
ggggaaacag 
cataaactat 
ggtagggcag 
taacctcctt 
eggggtcaaa 
caggaccatg 
atgtccttgc 
aagctgatct 
ggtcagaggc 
cacactcacc 
ctftgtcctgt 
ygggcacaga 
acatgtgggt 
cgtggctgtc 
ttgaccagga 
agctaacatc 
aaaaaaaaaa 



gattttatta 
tccagcccac 
tccagagtcc 
gtatagegge 
ggaacctcag 
catagecaag 
ar.gacccaca 
ccactggccc 
ggtcgcaggt 
tgccacacag 
tgaaaagggg 
ttcctcttct 
gaagagttga 
cccddcacaa 

ggggggcacc 

gtcctcatca 
geeggtcaga 
cacacttccc 
aaaaaaa 



cacagatagt 
aaagggggac 
aacagcagaa 
egggggctte 
tagtcctcca 
ttggggatga , 
cgctgcagtg 
tgctccccca 
gatgecoctg 
gtagtcgggg 
cactgcacga 
catccacctg 
gtgctgtgct 
ggcggggatg 
ctcacgtgct 
ccctcgtggt 
gatggacctg 
cacatttcct 



ggcggcaatg 
gatcacggcc 
ctr.gggggaa 
tgcccagggc 
cccagccatt 
gctgttcctg 
acaagaaggg 
gccgcaggcc 
ggcgtgatgc 
atgt'ctcgct 
gcacctgcca 
agaaaaaagc 
tgggggagac 
ctcccacgcc 
tggcctcaat 
ttcgctggca 
gccagatgtc 
gettgecagt 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320' 
1380 
1440 
15 00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1957 



<210> 99 

<211> 1112 

<212> DNA 

<213> Homo sapiens 



<400> 99 

ccacgcgtcc 

ccat tgaagc 

ccagcgctgc 

tatggtgctg 

ccccaggcac 

ggtggagcag 

ecttatgeua 

agatttcaga 

gaggcacgag 

cccaaggaga 

agccagtcgc 

geageggaaa 

tacagccccg 

ggctagagga 

tctcacaagc 

aggacccetc 

tgaaggctaa 

ttccccrttc 

ataaatccac 



gcccaccaac 
arattggaaa 
tccatcttcc 
gaaaatcagt 
t "cggccacg 
gtgatgeega 
aaacgtatga 
ctggcaacaa 
agcatcccac 
caggtggaac 
ccaagcctgt 
aggcccagct 
caagaggtgt 
acaccagcac 
tgccaaagaa 
caccctcaaa 
ccatttacat 
aaggaaaaaa 
agtaactaaa 



aaaggcactt 
aggaactggg 
agtgctgata 
tcatgtgctg 
ggatagaaga 
tttccattat 
gggtagaaga 
gagecctgaa 
ggtggtaccc 
cctgggggaa 
ctctggccaa 
caagtctgaa 
ggctggacca 
aaacgacagc 
cacatcccaa 
cgcgcatgtc 
cattctaaaa 
gtaatagact 
aaaaaaaaaa 



gcagttacat 
gaatttatca 
attatggcat 
agacatatag 
eggcaggagg 
aggggecaaa 
gagattttga 
gtgctccggg 
agtcataaat 
ggcacaccga 
gacacatcag 
gccgcaggca 
cgtggacagg 
ctcaagtctc 
gcagtaatca, 
cagagaagta 
atattttaag 
gtttccttac 
aa 



tcaccacatt 
aagcactcat 
tagccatcct 
gcggtcctga 
aaattgatta 
tgggccccac 
gagagagaga 
cattrgatgt 
cacctgtttt 
aagaaagcag 
ggaatacaga 
gcccagacce 
ateeggtcay 
cttcgagctt 
cagatgatga 
ttaatgeett 
ctgtattaca 
cgtgtttgaa 



tgtaaeggag 
gaaggaaatt 
gagtttctgc 
gagagaacct 
tagacc^gat 
cgagcaaggc 
tgtcgactcg 
accagacgca 
ggacacaaag 
•taczgaaagc 
aggztcaccc 
aggcagcaca 
cagcccccgt 
tgtaaagtaa 
agcaccttac 
aatagactac 
gtgctataaa 
aggaaaaaaa 



120 
180 
240 
300 
360 
420 
480 
540 
600 
. 660 
720 
780 
840 
900 
960 
1020 
1080 
1112 



<210> 100 

<211> 887 

<212> DNA 

<213> Kono sapiens 



<220> 

<221> SITE 

<222> (303) 

<223> n. equals a,t,g, or c 



<400> 100 
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ggcangagag 
ggrctgggta 
tatggactgc 
catgcacatg 
tctcactcta 
tcntttagaa 
ctctctttgt' 
tttcatgtaa 
tattctttgt 
> tctctgttag 
cattaaacat 
caccctttgg 
ctaaaacttc 
aggccgaggt 
aacccr.gtct 



aattataggc 

gtggctactt 

gcccctttct 

tcaaaatttc 

aagtcctatt 

atgkgctgta 

aatcacaaaa 

cagtatgtct 

aatgactcca , 

tgaacattta 

atcttttgac 

ctaga- tctt 

ctcaggaggc 

cjggcggatca 

ctactaaaaa 



gcatgatggy 
gtgtgtattc 
atatgtgtgt 
aaactatat t 
ccctctcttc 
aaagaacaa: 
atataaactc 
tagacatctt 
taaattattc 
ggatttttcc 
acatgtcctc 
tagcacataa 
tgggtgtggt 
cgaggtcagg 
tacaaaaactd 



gcttttggag 

actttatgct 

aatgcttcaa 

aaagagcgtg 

agtccatcat 

gatgtgtatg' 

actacatatg 

ttccr.attat 

tactgtgaaa 

agtttttaaa 

gcacacatgt 

cgtaaabatc 

ggctcacgcc 

agatcgagac 

aaaaaaaaaa 



actggcaatg 

aactcattga 

caaaagtgtc 

caar.taaaag 

tactaccagt 

tctctacagg 

ttattctacc 

tgcgcggaag 

acacaccatg- 

tattacagtg 

ataggtatga 

ccatagagtc. 

cgtaatccca 

catcctggcc 

actcgta 



ttctgttttg 

actgtacaga 

aatagtgatg - 

gaagtcatcc 

tcctagtaca 

tatatattca 

acatgctttt 

ctaccaacct 

Lctaaccagc 

acatattaa'a ' 

tggactttaa 

aaaaccaccc 

gcactttagg 

aacatggcga 



60 
120 
ISO 

.-240 
300 
360 
420- 

i 480 
540 

> 600 ■>; 
660 
720 
780 
340 
887 



<210> .1-01 
<211> 1248 
<212> DNA 

<213> Homo sapiens 



<400> 101 

ccacgcgtcc 

ggcagtgtcc 

gatatcaotg 

aaggatgacc 

ttctccacct 

aacatgcrgc 

tct tcagacg 

aggcccctgg 

acccccagct 

catgcaccct 

gctgaggctt 

acacacccag 

ggccactctg 

cctgccccat 

ctgccaggcc 

agtgctccaa 

acccttacta 

cccacccaca 

tgctcttctg 

ttatattcct 

aaaaaaaaaa 



ggtccattaa 
cctgtgagac 
accttggtac 
agccctcagc 
atagccagag 
gacggcagga 
actcatccag 
tgcagcagcc 
ctgggcccct 
acagtcggac 
cagtggaggc 
ctcccaccca 
ccacaagctc 
ctgcacacct 
ccactcacac 
gccccctcac 
ctacaggccc 
ctacgccaag 
tatgtgacaa 
c taaaactct 
aaaaaaaaaa 



ggtgacagaa 
caaggacctg 
catcaagctc 
tgcttcttct 
ccnaccggac 
ggagctggag 
cccacagccc 
cgagcccctt 
ggatgaggag 
t.ctgagccac 
cgtcggccca 
tggaaagcac 
taccctcggt 
agactcagtt 
cactaca'ggc 
tcacactact 
taccctcaac 
cccctcttcc 
catttttgta 
caaaaaaaaa 
aaaaadddaa 



ccgaagggcc 
tttgccgccc 
agcctggaag 
gtcaacaagg 
acaccctcac 
aatgggacag 
-caggcactg 
cccatccaag 
ggggccgtgg 
atcagtgagg 
aaaagcctat 
cccagtcctg 
acaacaggct 
cataagtcca 
tctacctata 
acaggctcca 
atcataggcc 
cacagcaata 
cattatgtta 
aaaaaaaaaa 
aaaaaaaaaa 



t tggccaacca 
tgccccaggt 
tcacatggag 
cctccacagt 
ttcgggaaca 
cacggtccct 
cccgccactc 
ttgccctccg 
ccccagtcct 
ctagtgtaaa 
cccggggacc 
ttcctcctgc 
ctgtccccac 
cagactctgg 
gtgccattac 
cccacaagcc 
cagtccagac 
gtccccaata 
ttggaatttt 
aaaaaaaaaa 
aaaaaaaa 



tgtggttgtg 

tgtggctgtg' 

ccccttcgac 

caccaagcgc 

ggc-ttctat 

gtcatctgaa 

accagcccct 

caggcctgag 

gycaaatggg 

Lgctgccttg 

tagcccacct 

cctggaccct 

atctacagac 

cccttcagaa 

cactacccac 

cataatccct 

taccacaagc 

cgtagatttt 

ctt tcataca 

aaaaaaaaaa 



60 
r 120 
180 
240 
300 
360 
420 
480 
' " 540 
600 
660 
720. 
780* 
840 
900 ' 
960 
1020 
1080 
1140 
1200 
1248 



<210> 102 
,<211> 1841 
<212> DNA 

<213> Homo sapiens 



<400> 102 

ggcacgagct 

agggcaacga 

ctgatcatct 

ccagtatttc 

caaaaggtac 

taattgtgaa 

gtggactata 



Sfgtgcaggag 
gcacaacagg 
tctgcaaatc 
aagagtcact 
cttaatttga 
gtaatgtaag 
oaatLt:tccg 



ctggagcagt 
agccacgagg 
tgccagcgca 
tcttr-.acctg 
agaagtgttc 
catgtggaag 
catntagcat 



accagttgtt 
agttggtctr. 
tcggtcctat 
gtgcaggaag 
tgctttgtat 
aaaggtga-a 
agaacattga 



gccgaagaga 
gtccaataag 
gtbggataaa 
gcagtctccg 
taatagccta 
atgtaactat 
attttattac 



tcggactggg 

caCgtggc tc . 

gaaattccac 

ctacgtacag 

ataattatta 

aaatcatgct 

ataagtgttt 



60 
. 120 
, 180 
^40 
300 
' '360 
420' 
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aatgggrcc- tctgcattaa aaaagttgat aaagatatag atgttaagct aaaaagcttg 480 

aatttattct gagatgttgt ttgaggtttt atcaatgtga gttcttaatt ttccattttg 540 

cagctcactc ataacttcat tgaatgtaafc tgtatgtgtc taataaactc ; tt taaatgct 600' 

agaggtattt catttgotar ttagcttctg aaagcccttc gtt&taaagg' gaagtgattt ' 660 

-ctaaatttg gtagcagctt gaaaacaagc accacaaatg taaaartgca -ggttctttt 720 

cttactaatt taatrttttt gccaagtcat gtgctagcaa aatgtttgaa attgctattc 780 

tttttctgtt agcttggtat ar.atgactr.a gactaagtat atgtatgtga tataatatat 840 ' 

gtgtacatgt ggctaaactg ctaatgggca aagattgttg gttcttctga tgctgagcat . 900 

acatcaagtt catcccatga ttactLtcca ttacgagcta ttgctgatgt gatgctactc * 960 

ctagctgcac caccagtaca aaccttaact gagaaagaac acccctccct cttagtLCcg 1020 

tgccagactg gtttgttagc tttagttgtt tcttagtgct ctagccatgt caccctgtgg 1080 

cactaatcag gacactgt tt i atattcttcc tcttcngtgt ccctattcac tttgtaatta 1140 

ttctatttta gctggttata tggtagttaa ttggtagcct ^gttttccttt gttttcttta . 1200 

cttgtgtcaa gctcccitguc gtLyLtutyL gatgagcagt gatttagaac tggccagacc 1260 - 

ttagaagagt cttcagcaag ctgccctagg gtaggggcat attccctgct tgttctgtac , ; 1320 

tggtacccca ggaggatgcc ttqtaactgc tccctggtca tgatagtggt aatgaatcca, 1380 

tticatgtaaa atgcaatatg . tctgc tagtt cggggcaccc gaggctcagt agaggtttag. 1440 

tatgatgcct tccctttcct ctttgagtra tgaagaaact aatttctggc acagtgtgac 1500 

tgctccaaac ctatgttctg ccgc^atctc ccatagagga cggcagactt ttttttctta 1560 

gtgcatttgc cggtaggtca aggagggatg gactttgaat tagtg-tcag agtcacagat " 1620 

cctacgtatg tgctcttcag taqaqqattt , tctgtgatcc tacaatgaag ggaaagctat , 16B0 

ataatttart ggtactrtcct aaagatagag actaaagtca. tggtagratt ggccaaatta * 1740 

attttaaaac tgaacaagtc atgaattttc atataaatca aagaactttt tccttataaa 1800 

agcttccactf "c tcgaciaagc tigaaaaaaaa aaaaaaaaaa a 1841 



<210> 103 

<211> 685 . , . . ' 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (678) 

<223> n equals a , - ,g, or c 
<220> 

<221> SITE 
<222> (679) . 

<223> n equals a,t,g, or c 



<400> 103 . 

actacggctg cgagaagacg acagaagggg ggcggcgacg gaggaggagg a'-ggaggcgg 60 

tggtgttcgt cttctctctc ctcgactgtc gcgcgctcat cttcctctcg gtctacttca ' 120 

taattacatt gtctgattta gaatgcgaCt acattaacgc tagatcatgt .tgctcaaaat - 180 

taaacaagtg gctaattcca. gaattgattg gccataccat tgtcactgta ttactgctca .240 

tgtnattgca ccggttc'atc ttccttctca actcacccgt cgccacrrgg aafar.ar.atc 1 / 100, ;, 

gatacattat ggtgccgagt ggtaacatgg gagtgttiga tccaacagoa atacacaatc 360 

gagggcagct gaagtcacac atgaaagaag ccatgatcaa gcttggtttc. cacttgctct 420 

gcttcttcat gtat'ctttat >agtatgatct tagctttgat aaatgaccga agctggagaa ( 480 * 

gccgtggttg a age. eager t. acactacagt gcacagt-ga ggagccagag acttcttaaa 540 

tcatccctag aaccgtgacc atagcag&at atattttcct cttggaacaa aaaactattt 600 

ttgccgtatt zttaccatat aaagtattta aaaaacazga aaaaaaaaaa aaaaaaaaaa 660 

aaaaaaaaaa aaaaaaanna aaaaa * ' . 685 



<210> 104 
<211>- 1168 
<212> DNA 
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<213> Homo sapiens 



<40O> 104 1 - 

ggcacgagcc cccgccccct ccctcccxcc cczccc.t.tr.c ccggccccgg ccctggcccc 60 

ggcccattcg ccgttgggtc ctctgctagg gagga-gtcg gg.tcgtcgc tgcccagcgc 120 

cctggccctc. tcgctgttgc tggtctctgg ctcccccctc ccagggccag gcgccgctca 180 

gaacgtgaga gtacaatctc gacaggatca gaagtagaga atgaagttgt aagagaaagg 240 

gaaagacaga agaaaggctq cagLagtaca aggagaaaag cagga-gcaa gaar.gaggaa 300 

tgaatcttrg tr.tgnggagc atccggaaaa acacaagctg tcagaaagag taaatagacc 3 60 

agggacctct aaagtaaatt cacacatcaa agttaaaata atgttggaga aLcacccccc 420 

gtgaaaaaLL yyeLlagctt . tcagtatgcc - tctr.taaaca- aaacac tatc attttataca , 480 

aa-ttttaaa atgttgttca taatatggag tttaatttta agcaccttag ataatgcatt 540 , 

tgctagcttg tttgtgaaat ttttatggat tttttt'tttc aaaattccca attttagttg 600, 

gtaaggacta acrtcgggaa gacaggaaac ccctccagta aaaczaattg gttataaatg 660 

gttacatatt. ■ ttnggcttat atacatdaca aak'- l Li: Laa ctgaatttaa aagtgatctc 720 

gtgcaaaaac attatrctag tgatattgat gtctaaattt aaaagagtga atacacaagt 78C 

aaaatatatc T.'gcttzCtaa ggatctattc agtttaggaa ggagacjtcaa aatttgtatt. 340 . 

taatttcaat ttat tatat t,- gcctgaaact gaaagtaggc ccaacttttg tctgcttttg 900 

tgtatctaca aaggcaaaca tttatcaatc nactct-att aatttgagat tattLLydcc .960 

tgactcccca yaactttcgt ctatctgtat agaaatgggg ttttctaaat atttaaagaa 1020 

tttcctacgt atgtaaattt acttgcacac agtaagttag gggagacatt taatattcct 1080 

caqatacctg ctgttttct t 1 gttgtgt ttc tzgtttttca aataaataaa ctgagtgtta 1140 

tctgttcatt aaaatagaat gtagtcag ' 1168 



<210> 10b 

<211> 1175 

<212> DMA' 

<213> Homo 'sapiens 

<220> 

<221> SITS 
<222n (24) 

<223> r. equals- a, t, g f or c 



<400> 105 ' • 

gcatggaaat atzatcctata catnaagtgc tgtcccgagc tgtacatwta gccatattta ; 60 

grattttatt argcaggatt' caaggcctcc tanaaatgga tattcagatai l.t_Laaaaaza 120 

cacta'ttaag gaatgcaatg taaaancaca ctaacatttg catagcacct tatagcttac 180 

aaaatgcctt cacatatacg agctcattta tttctccttc tcttcttt'ct attcatctat' 240 

caaggaatat cttctataag ccaggcatca ggcttgacac tgaagacaca aaatgaaaaa ,. 300 

gatatacaag tctctattct taaagagttc gtagtttaat gaagtgaccc aaaaagtaaa 360 

. cagttgcaat ataacatgca aaggagtgLg atgaaataca cctagagtac tatgaaaatg 420 

gcatagaggg accaagcctg taatgaagtc agagaaagta aaagaggtga agcckgagct . 480 

aagttttaaa ggatgaacaa aaattgacca gacaaaggag ttggagaagt gggacggtar ' 540 

agga-aaaag cctr.rctaat ttggggaagt gtgagtcgzc gatattagat gatattatta 600 

atagattgtg cacagagtgc acatggtaga gaggtgagag atgaggtagc actgttaggg 660 

cagauaygLy aattctcaca gtaaccctgt gwtgtacmtt atcatcctcc tctccatta^ 720 

gtaggggaga acaccaaagc ccaaagagat taagtaactt' 'tccagaattr tgatttcaaa 780 

tcctaagcgt ttggtagctg tgtttccctg catar.agr.ct gcctcaaaag aaaatttaaa, 840 

gttgtatttt gtattaaatt taaacaaata raggccgggt gcagtggctc ' acgcctgtag, \ 900 

.tcccagcact ttggtagyct; aaggtgggcg gatcacctga ggtcaggatc tcgagaccag " 960 

cctggccaat atggtgaaac cccatctcta ctaacataca aaaattagct gggcatggtc • 1020 

gcacatgcct gtaatcctgg ctacttggga aqctgaggca ggagaatctc tt.gancccag 1080 

' gaggcggagg ttgcactgag "ccgagattgt gccactgcac cacaacctgg acaacagagc ' / 1140 

aagatcccgt ctcaaaaaaa aaaaetaaaaa ctcga - 1175 



<210> 106 
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<211> 1021 

.<212> DNA 

<213> Homo sapiens 



<400> 106. 

ggcacgagaa taaaatagat ttagggtatt cgacacattt aatacataat gaacag^goa 60 

cttcttgaac aatataatgc ccaaacgcga ccgatatgca aaacatttaa atcccttaca 120 
ggatgtgcat ttaccaatca gaacacatgt tggctttact gctigtatca gttttctgta , 180 

ttagtttgct ' tgtatcagtL tgctggggca gccataacaa agtaccacag aaatttatat 1 240 

tctcacaatt ctgggggcta gaagactgag ataaagatgt ctgcagggtt ,ggtttcttct 300 

aaggactctc ctgttggctt gtagatggct gttgtctccc tgtg.cttca cctgcacttc , 360 
cctctgtacc igcccgtatc tcaatccct t ctcacaagga cagagrcata ttggattagg . 420 

gccccaccct aacgacctcc ttttaactta attacctctt tagagactct gtcttcaaat '480 

acaggcacat tctgagqtac tttaaggtta gaacttcaac atacgcattt ggaactgggg 540 

cataactgag tccatgacac tattggagag ggttctggag ccacaacaga tggaatgaat ; 600 

gggccttgtc tggattctaa tttaaataaa ccaactatag aaaaacatgt tttaggcaat 660 

cagatt'tttt gtxatgggct aaatacccag aagatgtcaa gaaatgttta ttaatttcat 720 

taggtaaaat aatgttggtg tggttatgta gagtatgact tacgtgcaca gaacagaatg 780 
ataaggtgtc caggatttgc tttaaaatat tctagatztt ttcaaaaggt gagaggaaat , 840 

gacacat-.gta cacctaaatt ttgatggttg tcgaagccgg ctggtaggta cctagtattt 900 

cattaiactt ttcccnttgt gtgtttggaa atttttataa taagcaggaa aaataaaagt 960 

aagaacatat gtactataga taatccgaaa tatgcagaat catgaaaaaa aaaaaaaaaa 1020; 

a 1 1021. 



<210> 107 
<211> B30 . 
<212> DNA 

<213> ( Homo . sapiens 



<400> 107 ' - , s " - ■ 

ggcacgagta aggactgtgt tctttacgca tttcttgatc caggcatggc agttcctctt 60 

ttcctgtaca taztcacact cctgccactt ctaccctttc tcttatccct ctgcttttca 120 

cctctgactg taaaaaqaag tagcagttcc gaaagcaaga gttccctatg aacacggaag 180 
aagacattgg caacttttga gtacaacaac tatatttaat agagtaattt aagaacatca , 240* 

gccagtgaac tttatacaag atagtgaaag agaaaaggaa gattaattag gggtagttta 300 

ggatgccact aaacagccca gaattagggg agtagtcgtt gaatagaaag gaggccacaa 360 
acttgaggga tataagctaa gaattggtaa gccaagaaga aggaaaaggt ttgggcagta " 420 

aggataa-ga ggaacaaaat agagaactca gaagcaatat ctgactgtca tcattggaag 480 

aatttttttg cctgctcgag gctggatat-.t gaagtggacc aggatacttg agcgactatc 540 

tgatgggctt ttggaactag ctctcaagag gtgaaaatca gctttttttc ctct~tcutt 600 

cttttttttt ztttttgagg caaggcctca ctgttgttga- ggctgaaccc cctgggctca 660 . 

agcagctgtc ccattgcagc ctcctcagat actctgtaag ccaaggcagg gggaatattt 720 
tgtgctcagr agtctgaggc tgtggtgagc -aagatcaca ctgctgtgct cacctcagcc < 780 
tgggcaacac agtgaaaccc cgtctccatc tgtttaaaaa aaaaaaaaaa ' . 83 0 



<210>- 10H 
<211> 1301 
<212> DNA 
. <213> Homo sapiens 



<400> 108 • , t 

aagagggtgg crgccgagtt rttgtttgtc ctgcgctctig agagtggtga gttatggaga 60 

cccaggtccc agcccacccc acctcagcca ggcttgcaca ctgacc-ccg ccttgcccct 120 

cagtgccccg atitcaLcaag cacacaggct .atgggaatgc. tgctggcctt ctggctgcca 180 

ggggccccac. ggcaggaggc cggcccgagg gccagtacic* agaggatgag gacacagaca 240 

cagatgacta caaggaagcc aaagccagca taaaccctgc gaccgggagg gtggaggaga . 300 

agccgcct.aa ccctacggag ggcatgacag aggogcagaa ggagcacgag gccatgaagc 360 
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tggtgaccat gtttgacaag ctctccagga 'acagagtcat ccagccaatg gggetgagtc 420 
cccggggtca -cttacgtcc ctgcaggatg- ccatgtgcga gactatggag cagcagctct 480 . 

cctcggaccc tgactcggac cc-gactgag gatggcagct cttctgctcc cccatcagga 540 
ctggtgctgc ttccagagac ttccttgggg ttgcaacctg gggaagccac atcccactgg 600 
atccacaccc gcccccactt ctccatctta gaaacccctc ctctcgactc ccgttctgtt 660 
catgatttgc ccctggtcca gcttctcatc tctggactgc aacggtcttc ctgtgctaga 72 0 

actcaggctc agcctcgaah tccacagacg aagtactttc ttttgtctgc gccaagagga 780 
atgcgt-cag aagctgctgc ctgagggcag ggcctacccg ggcacacaga agagcatatg 840 
ggagggcagg ggtttgggtg tgggtgcaca caaagcaagc accatctggg attggcacac 900 ' . 

tggcagagcc agtgzgttgg gg't.atgtgct gcacttccca gggagaaaac ctgtcagaac 960 

• tttccatacg, agtatatcag aacacaccct tccaaggtat gtatgctctg ctgttcctgt /. 1020 

cctgtcttca ctgagcgcag ggctggaggc cccttagaca' ttctccttgg. tcc-cgzcca 1080 

"gctgcccact gtagtatcca cagtgcccga gttctcgctg gttttggcaa tcaaacctcc 1140 

ttcctactgg tttagactac acttacaaca aggaaaacgc ccc-cgtgtg accatagatt 1200 

gagatttata ccacatacca cacatagcca cagaaacatc atcttgaaat aaagaagagt 1260 

- ttggacaaa aaaaaaaaaa aaaaaaaaaa aaaaaactcg a 1301 



<210> 109 

<211> 1932 

<212> DMA 

<213> Homo sapiens 



<400> 109 . ' , > 

aatttttttt. ttttitt tttt tttttttgaa aaaaaaaatg ggtagtgtat attctgcagg 60 

tttaagacaa ctcaggacaa taaaaacaat ggactttaca tgtgtatata Latagctctc 120 

tcaggcacca taatcagtat gagccaacaa cattcaaact tgattcaggc cacattcaga 180 

catttgctct tatatacaaa tatttaaatt aaatacaatc tgaaacgtgt tctgttacat 240 

acaaaaaagg aaaaactata caacgcagag cagtgtgtgt gttttaaata attacattta 300 

catgcaagct aaatggaacc agcaatggtg ctcaagttct tatcatccct tccagaaaat 360 

cttrttctac catctcccct attttttgcc tggctttgct ggaacatggz ttgcggtLct 420 

ccagtttcat gtccttatta gggaaggcat Ltgagtagag gataggactc cctgagtgtc 480 

ccccacatcg gcttgtgact ttgctgttga agacttgact gagcacattg aagaacggca .540 

ggagctgctc catactgcgc acggtgcaga tggtgagcag caagtgccct ggctcccaac, 600 

ccaatgttct ccctgagttg tc-tcctctg gatttttctg cagaaaacaa aaagtgaact 660 

ggtattaata caacagacaa tgtggtatgt ragaaaaatt aaaaatatat aaaccttggc' . 720 ' 

aattggtcaa gaaa;gaa:a caaatgacat caagtttcta actcctgacc tgatcaaaac / 780 

ccttggtgcc cctgagacct tttactgcca cttattagtt ttacatggag cagtctaaca 840 

ttgtagtaat agttcccaac cagaatgcgc agataagctt agttaacaga aatagctttg 900 

aacaggaata gagtcaaaca taaaagtttt argttgtgct ttgtatttac tcaaaaagct ' 960 

cccaggtttc tgaaccctca ctactgtaac caaggaccag gtcacaaaat tactacagaa 1020 

aaaaggaaca-aagtgcttca cacatctcat aatatatccc cttttattat aattagttaa 1080 

tcccctttta tctaaatggc ctaaatttgc catga'tggta gcagtgtcca aagtgaataa 1140 

ttactgtcag. tactgcatca cagagaaagg aagggatccc tcaggagaca ctgctgrctc f 1200, 

cttctgggtt gtgctaaaca acatagggag gaaagctgga cctggagtca aaggaa-tga ■■ / 1260 

gttagtgtgc tggctctgcc atacttacgg cacccttggg caggaLdtac daeiijy Ltcct 1320 

cacttazaaa atgggacagt ctaaaaccac cttttagtag agaagtcaaa tgagaaggta 1380 
tgtgaaaacc ctgtcaacta aatataaaga ctaataattt gggtattaag aggctagttt. . ■ ." 1440 

gagaagccac ctgaattaca caaacacagc tacagacatc attctg-cta gagaaagata 1500 

agagagaaca ggttggttga acttgggcag aatcacagat acaattccac accaaagaat 1560 

gaaaataagc aatgaaztag acagaaygaa gaaatcatga agacccagga agcagaatta 1620 

caatct-gtca cattaacaaa tggagtttgc cttctaagat cagaegtcgc tcagaaactt . 1680 

tcatcgtt-a cctaataact uaatatcacc agtttcctag tgggtcaagc agatgcaaaa 1740 

tccagcttat cttcttcr.ac gcgc-ctcaa gcttattgcc tattttaaag taaaatcctq 1800 
aaaaaggaaa atactagctt ggtgcaaacg taattgcggc ttttgcattg tcgaaatttg " 1860 

ccgttttota ttggagLaca ccctcaaata aatgtggtca tgttatacaa aaaaaaaaaa 1920 

aaaaaactcg ag ' ( ' . . 1932 
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<210> 110 

<211> 1534 

<212> DNA ; ■ 

<2I3> Hono sapiens 

<220> ■ 

,<221> SITE 

<222> (1212) 

<223> n equals 6,t;g, or c 



<400> 110 ' r 

tcgaagatag gttcgagacg gtggatgttg cagctgatca tgcagttggg ttcgg-gctg 60 

ctcacacgct gcccctttLg gggctgcttc .agccagctca tgctgtacgc tgagagggct 120 

1 gaggcacgcc ggaagcccga catcccagtg ccttacctgt atttcgacat • gggggcagcc 180 

gtgctgtgcg ctagtttcat gtcctttggc gtgaagcggc gctggtxcgc gctgggggcc 240 

gcactccaat tggccattag cacctacgcc gcctacatcg ggggctacgt ccaccacggg 300 

' gactggctga aggtccgtat gtactcgcgc acagrtgcca tcatcggcgg ctttcttgtg 360 

ctggccagcg gtgctgggga gctgtaccgc cggaaacctc gcagccgctc cctgcagtcc - 420 

' accggccagg tgttcctggg tatctacctc atctgtgtgg cctactcact gcagcacagc -480 

aaggaggacc ggctggcgta tctgaaccar. cr.cccaggag gggagctgat gatccagctg 540 

ttcttcgcgc rgtatggcat cctggcccct ggcctttctg tcaggctact acgtgaccct 500 

cgctgcccag atcctggctg taccgctgcc ccctgtcatg ctgctcattg atggcaatgt 660\ 

tgcttactgg cacaacacgc ggcgtgttga gttctggaac cagatgaagc tccttggaga 720 

gagtgtgggc atcttcggaa ctgctgtcat cctggccact gatggctgag ttttatggca • 780 

agaggctgag atgggsacag ggagccactg agggtcaccc tgccttcctc cttgctggcc 840 

■ cagctgctgt ttatttatgc tttttggtct gtttgcttga tcttztgctt tcttaaaatt 900 

gttttttgca gttaagaggc agctcatttg ' tccaaatttc tgggcctcay cgct-Cyggag ' 960 

ggcaggaacc ctggcaccaa tgctgtacaa ggttttcttc ctgttaggaa gaactcgagg 1020 

ccagctgccc accgagtcct ctgtccctga agaaagggag tattgggcag ggcttgggat 1080 

ccggctactg agagtgggag agtgggagac agaggaagga agatggagat tggaagtgag 1140 

caaatgtgaa aaattcctct ttgaacctgg cagacgcagc . taaactctgc agtagtgttt 1200 

ggagactgtg anagggagtg tgtgtgttga cacatg^gga tcaggcccag gaagggcaca 1260 

,'ggggctgagc actacagaag tcacatgggt tctcagggta tgccaggggc agaaacagta i320 

ccggctctct gtcactcacc ttgagagtac agcagaccc- gttctgctct gggctgtgaa ; 1380 

ggggtggagc agqcagcggc cagctttgcc cctcccgctg tctctgtf.r.c tagrtceatg "1440 

gttggcctgg tgggggtgga gttccctccc 1 aaacaccaga ccacacagtc ctccaaaaat - 1500.. 

aaacatttta tatagacaaa aaaaaaaaaa aaaa 1534 



<210> 111- 
<211> 287,1 
<212> DNA 

<213>'Homo sapiens 
<220> 

<221> SITE 
<222> (1234) 

<223> n equals a,t,g, or c 

<220>. 
<221> SITE 
<222> (1259) 

<223> n equals a, eg. -or c 

<220> , 

<221> SITE 

<222> (1283) 

<223> n equal's a.c.g, or c 
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<220> • ■ - , 

<221> SITE : ■' 
<222> (1284) ■ 

<223> n equals a , : , g , "or c 
<220> 

<221> SITE 
<222> (1297! 

<223> r. equals a, t, g, or c ■ 

<220> ' • • • 

<221> SITE 1 
<222> (1378) 

<223> n equals a,t, g, or c 
<22 0> 

<221> SITE , ' 
<222> (1912) 

<223> n equals a, t,g, or c> 
<220> 

<221> SITE 
<222> (1913) 

<223> n equals a,t,g, or c 

<220> . 
<221> SITE ; 
<222> (1935) 

<223> n equals. a, t,g, or c 
<220> 

<221> SITE 
<222> (1947) 

<223> n equals a,t,g, or c 



<400> 111 ■ - 

gcggcgaggc ggcggtggtg gctgagtccg Lggtggcaga ggcgaaggcg acagctccag 60 

gggttggcac cggccccgag aggaggatgc gggtccggat agggctgacg. CLgctgctgt . 120 
gtgcggcgct gcngagcttg- gcctcggcgt cctcggatga agaaggcagc caggatgaat " 180 

ccttaggatt ccaagactac tttgacatca gatgagtcag taaaggacca tactactgca .240 

ggcagagtag ztgctggtca aatatttctt gattcagaag aatctgaatt agaatcctct 300 

attcaagaag aggaagacag cctcaagagc caagaggggg aaagtgtcac agaagatatc. • 360 ■. 

agctttctag agtctccaaa tccagaaaac aaggactacg aagagccaaa gaaagtacgg 420 

aaaccaggta gtctggacat tttccttgct ctxtgattta tttaggggac .aactgaaaat 480 

tttaagctaa tgaataaaga agctcaagaa gactggcttc actgattatt acccacaaat 540, 

aatar.atgga gtgtagtttg gagagaattt c-agatttta atataccaaa gttactcacL 600 

taacagattt gacccaagtL acanaagcrg aatcccatag atgagatrgc atttatcttt 660 

ttaLtaatat ccLgacgttt cttttuggaa tcaagtatgt aaataccttt ctcactgaga 720 

aatttaaaaa aataatatgt cagggtagca cacaaaacag aaactcagga agaaccgtct 780 

aacatgccac tacggtcatt ztcaggattg cggtaggatt attactacaa taaccttgtc B40 ■ 

gatcatccta ggtaatttga tttoggtgtt tatCL-ctcca gcttgacgct: tgtgtgtact 900 

tccatcctat uctaacct^c acaarcctga aacctctact tctcaattra aaacttgacc 960 

ttttccacaa tagtcttc-a aatatcaatc c:tttaaaaa tcattaatgt acgcattcta 1020 

ccatgtcttg ttgcccttct gtcattr.a tt atcccaatga tcactgcctt atctgtttat 1080 t - 

ta-tccatca v ctaaccactc tcaggaactg a tccaaatcg- gaaccttt ta aaaaaataga 1140 

attttttttt actocaoata taccaL lclc , a^gaatacaa aactactttt: aaaaaattaa 1200 

aagnacctga atcccatcaa ctagaaataa =agntgctca catcatggag aatattctnt 1260 

caggatttt^ ttaggtgtat car.nagnttt aagaaaagta aattgggtca ' aaatgtgtat 1320 

tctatgctgc agcctgcatt. tr.gcac:t.t.Ha cagcttacca ' tgaatgttt t tccacgtnat 1380 
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ttagc-tact. r.tcaacttga tagttaatgg ctataaaaaa ttttatttgt- aaatacatgg 1440 

taccacaaac caaaacgctt atgttttgct gggagatcat tttagatgta tctttgtgcc 1500 

tatctgtgaa aatttcctta gaattcttaa agtaaatttg ctgaattgga cgtaaaagct 1560 

caaaagacag tagatacaca ttgttaaatt gttcttcaga aaagtgattc tgtttactLC ,. 1620 

ctctgaaaat acctctctcc ctgtacctca cccaagtgtg tattaccatt ttaaaatttt .1680 

tactaaatat aatttaattc attcttcatt ttttaattag ggattttttt ctztttcttt. 1740 

tagtttetat actttcagag acatgg^ctc actctgtcac tcaggctgga gtgcagtggc 1800 
acgaLcatag .ttcactgcag ccrtgaactc aggtgatctc cctctttggc ctcccaaagt . I860 

• gctgggatta caggcatgag acactacatc aagcctggga aat"tttaaa cnngcataaa 1920 

aagtagaggg- taaantpcca ccccttncca .accctcactc agct-caata aacatcaaca ' 1980 

ttctgctggt cttttcatct ■ ttactaccc^ acacacattt gccctttttt tctttcctag 2040 

aatattttaa ggcaaattcc gcatgtgctt ttacatctct gtgccccccc ctagLaagaa 2100 

tgctcttttc agcaaaatca cagaattagc cattactcct ttattaccta atacccagac ' 2160 

■-tgtttaattt ttctagttgc ttcaagtgtc tgtttacaat tcatttgttt gaattcactt .2220. 

ctaagctact caggaggctg ' agtcaggaga attgcttgaa ccctagaggc agaggttgca 2280 

gtgagctgag atcgtgccac tgtactccag cctgggtgac aaag'tgagac tccaegtcnt' 23 40 

aacaaaacaa aacaoaacaa aacatgacga aacaaacaac, acaa-.tgaat aatatccctc . 2400 

agtcattatt tagagaagag atgacccaat ataaatgtaa tatgtgatcc atttggtcac 2460 t 

agaatgaatt tcagtcattg . cttaggtaag cctggtaaaa aaaaaaaaaa cgggattcca 2520 ; 

tgattagtgt gttgaaaaag gggtrtgcrcc aaagtgtaac agacagagcc gggtgcagtg 2580 

gctcacgcct gtaatcccag aactttggga ggccaaggca ggcggatcac gaggtcaaga 2640 

gatcgagaca atcctggcca acatggtgaa gccctgtatc aaccaaaaac acaaaaacaa 2700 

ttagctgggc gtggcgaca- gtgcctgtag tcccagctac tcaggaggct gaggcaggag 2760 

'"aattgcttga acctgggagg cggaggctgc agtgagctga gattgctcca ctqcactcca . 2820t 

gcrctggccac agagtgagat tccar.ctcaa aaaaaaaaaa aaaaaaaaaa. a 2871 



<210> 112 ' \ . . 

<211> 1037 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE ■' 
<222> (936) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 

<222> (9461 ' 
<223> n equals a,t,g, or c 

<220> 

<221> SITE , 
<222> (951} 

<223> n equals a,t,g, ore 



<400> 112 

ccggtccgga af.cccgggt cgacccacgc gtcegggegg gcgcaggacg tgcactargg 1 60 

ctcggggctc gctgcgccgg ttgetgegge tcctcgtgct ggggctctgg ctggcgttgc. 120 

tgcgctccgt ggceggggag caagcgccag gcaccgc=cc ctgctcccgc ggcagctcct .180 

ggagcgcgga uctggacaag tgeaeggact gcagcac-tc ccgccccctt ccggccgctt -240 

tcgcccatcc ttgggggcgc tetgagectg accrtcgtgc tggggctget ttctggcttt 300 

ctggtctgga gaegatgecg cagga'gagag aagttcacca crc era tag a ggagacegge 360 

ggagagggct. gcccagctg- ggegctgate cagtgacaot gtgccccccg ccagccgggg 420- 

ctcgcccact 'cotca'trca: ica-ccaLtc Lagagecdyt. cLutgcctcu cayacgcggc 480 

gggagccaag ctcctccaac cacaaggggg gtggggggcg gtgaatcacc tetgaggect . 540 

gggcccaggg ttcaggggaa ccttccaagg t'gtctgg- tg ccctgcctct ggccccagaa 600 
cagaaaggga gcctcacgct ggctcacaca aaacagrr.ya enctgact «ri ggaactgcag , s 660 
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catctgcaca ggggaagggg grgccctcct tcctagaggc cctgggggcc aggctgactt 720 

ggggggcaga cttgacacta ggccccactc acccagatgt. cctgaaattc caccacgggg 780 

gtcaccctgg ggggttaggg acctattttt aacacca'ggg ggctggccca ctaggagggc 840 

tzggccctaag atacagaccc ccrccaactcc ccaaagcggg ggggggatat ttattttggg 900' 

gagagtttgg aggggagggg gatttttttt ttaaangatt .ttttantttt naaaaaaaaa 960 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaaaaa 1037 



<210> 113 

<211> 2214 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE' . 

<222> (289) 

<22 3> n equals a,t,g, or c 



<400> 113 

cggacgcgtg ggtcgaccca cgcgtccggg aaaaarggaa aaratgccgt gtaaaatctc 60 

gttctgtgtc tgaattgccg taggctcaga tcttcatttg aggttctgtg tctgaattgc . 120 

cgtaggctca gatcttcatt tgaggttatg ttccataagt taacgttgat ctcgtgtgag 180 

ctttcggtag ctggagtaac acaggcggcc tcacagcgac ctctccagcg ccttccaagg 240 

cacatctgca gccagcgtant tcctcctggg agatgcctcc tcaaggccnt gctccagacc 300 

acgtgggrar ggcctgacaa gccaattccc aggctgtccc cacccttgra gagtgaccct 360 

aaacgctaga cagatgggcjc atgggaaaga aaagaaagct* gcagaccrca agttaaaatt 420 

ccctcaaaaa cgctccta^t tarctgcttt ttctga'aagg ataaaggctt tttgaaaatt . 480 

attttctaac aaataaca:g aacacttcta gaaaccctag aaaaacacaa agtattcaaa 540 

atagaaagaa aaattaccca ttactcttta agccagcatt atccattgcg gtgcttttgg 600 

agttgggtga ggccgtagcc tctgccaagt caaggagccc ggtggtggct gtggcattcc 660 

tgcagggttg tttttttttc tttgagatgg agtctcactc ttgtcacccc agctggaatg 720 

t tggtggtgta .aacagctcac tgcagccttg accctgaggc tcaagcgatc crtctgcctt ' 780 

ggccrcctga gtagctggga tcccaggcga gagtcaccac accctgtcca tgttcctgca 840 

ggtcttgata tgcgaggarg ctgtgtcttc cctgccacat t^tcttcttc tttcttgaga 900 

cagacccttg ccccatcacc caggccagac tgtg'gtsgtg cgaacacggc tcacrgcagc 960 

ctcgaccctc aggctcaagc gatcctcacg cctcggaccc ccaaagtgct gggaccacag 1020 

gcgagagtca- ccatgctggc ctgaatcttc agggtatttr cggttgargt gycacttact 1080 

carccatscc tgtttcaaga gtgtaggtgg zcaccctgtc tctgccgctg acctggcctg 1140 
gaccctcggc ' tgtgagaggg aggggtgggc tgggctggag gaacctraag ccctcgtgat * 1200 

grcacaagcc catccggctg ggcatcccct gctgtgtcct gagctgcaca tgccccaggt 1260 

ggcccccaca. gcagaggcga gccactgrag ggtgragggc ttccacggac ggtctccagg 1320 

ggragaagaa gggcccaggc ccccaggaga ctcaggagac cagagcctgg ggtcaggggc 1380 

tmagcagggg ctyarccagg gciggatgtc cggagccagc cccgmagccc tgkgktcrtt 1440 

gttcttcgca ctcccaccgt ccgtgtgaac agctccagcc ccacctgcgc ctccctgcgc 1500 

tgggctccat cagggagccc agaagacgtg tgtgcttctg aaattgggtc cctacatgcc 1560 

tttgtcccag- tgcaccttgc tccttccatt cactatcgag atttaaatgc ctgttttctc 1620. 

cccagaggtt gacggatata ttcagacgtt acgacacgga tc'aggacggc tggattcagg 1680 

tgtcgtacga acag^accrg tccatggtct tcagtar.cgt atgaccctgg cctctcgtga 1740 

agagcagcac ancatggaaa gagccaaaat gtcacagttc ctatctgcga gggaatggag 1800 

cacaggcgca' gttagatgct gttcttcct t , tagaLL t tgt cacgcgggga cccagctgta . I860 

Vcatatgugga taagcngatr. aatggttttg caactgtaat agtagctgta ccgttctaat 1920 

gcagacattg gatttggtga ctgtctcatu gcgccatgaq gtaaatgtaa tgtttcaggc 1980' 

attctgcttg caaaaaaatc tarcatgtgc ttttctagat gtctctggtt ctatagtgca 2040 

aatgct-tta ctagccaata ggaattttaa aataacangg aacttacaca ' aaaggcc ttt 2100 

catgtgcctt acttttttaa aaaugagcct accgtat-ca ccggaatotg cgacgtaagc 2160 

aataaaggga atgtcagacg tccaaaaaaa aaaaaaaaaa aaaaaaaaaa aaag 2214 
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<210> 114 
<211> 3300 

<212> DNA • ' . . . .. . . 

<213> Homo sapiens 

<400> 114 . ■ • * 
tcgacccacg cgtccggtga gaggcaagga cttctcatct ttagggcatt ctgaccacgt 60 



ectgcttcag ayagattgtt cccggcgtct cag-gctatg gggagcaggt tcctcctggt - 120 

cctgctctca ggtctcactg tctcactggc cctgccagga tcagaagcca agaattctgg 180 

agcttcctgt cctccatgcc ccaaatatgc cagctgccac aacagcaccc acrgtacttg 240 
tgaagatggc tttcgggcca ggtctggcag gacatacttt catgattcct ctgogaagtg ■ 300 

tgaagatatt aatgaatgtg aaaccgggct ggcaaagtgc aagtaLaaag catattgtag 360 

gaataaagtt ggaggttaca tc-gtagccg tttggtaaaa tatacttta- tcaactttct 420 

ggctggtatt atagattatg atcatccgga ttgttatgag aacaatagtc aagggacgac 480 
acagtcaaac gtggatattt gggtgagtgg ggtgaagcct ggatttggga aacagctggt ' 540 

acgtataact atgccatttc cctacccaaa cattaacatg tcttcccgtg atttttaggg 600 

tagggtagtt ctatccaggg gtoattttgt cctctgcccc aaggtcatct gtcaatgact , '660 

ggggacactt ttggttgtca taccttgggg gngatgtgtg cgactggcat ctggtggatg 720 

gagaccaggg atacagctca acatcctaca gcgcccagga cagcctccca caatcaagaa 7*30 

gtgcctagtg ccatatg.cc atiagagatag agaaatacaa gtgtaggggg aaagtgcctc *" .840 

agctggcatt caaagaccta catcaagcac ctacattctc aatgccacac gcacctngta 900 

gcaccttaat aaacatcgtt gcctLccggg ctccccactc , caacacctgt gcatattccc 960' 

tatctcrtac atttcagtaa gagtcaatct- aattagctta atttttttgg aaggaaaatc ; 1020 

tgagaagaaa tggaagcaga gaggactttg caagaagggc tactcaacta attcaaagcg 1080 
tggagttgag catctggaat gcgagttttg cctctccagg aaagggtcaa acttctgaat, 1140 

■ttgatatagt ctatgaaacc aagaggtgca atgagacaag ggagaatgct tttctggaog 1200 

ctggaaacaa caccatggat atcaactgty cLgatgcttt aaaaggaaac ctaagagaga 1260 

gcactgcagt tgccccatca cttatcaatc tcttggggat attctgaatg catccttttt 1320 

tagtaaacga aaagggatgc aggaagtaaa actgaactct tacgttgtga gcggcaccat 1380 . - 

cggtttgaag gaaaaaattt ccctctctga acctgtgttc ctgacttttc gccataatca 1440 

gcctggtgac aagagaacaa aacatatctg tgtctactgg gagggatcag agggaggccg 1500 • 

ctggtccacg gagggctgct ctcatgtgca tagcaacggt tcttacacca aatgcaagtg " 1560 

cttccatctg cccagcttcg ccgccctcgr ggctcttgcc cccaaggagg accctgtgct 1620 

gaccgtgatc acccaggtgg ggctgaccat ctctctgctg tgccccttcc, tggccatcct 1680, 

caccttcctc ctgcgccggc ccatccagaa c.accagcacc tccctccatc tagagctctc 1740 

cctctgcctc Ctcctggccc acctcctgtt cctgacgggc atcaacagaa ctgagcctga , 1800 < 

ggtgctgtgc tccatcattg cagggctgct gcacttcctc tacctggctt gcttcacctg * 1860 

gatgctcctg gaagggctgc accccttcct caccgtcagg aacctcaagg tggccaacta 1920 

caccagcacg ggcagattca agaagaggtt catgcaccct gtaggctacg ggatcccagc 1980 

tgtgattatc gctgtgtcag caatagttgg accccagaat tatggaacat ttactcactg 2040 

ttggctcaag cttgataaag gattcatctg gagcttcatg gggccagtag cagtcattat 2100 

cttga~aaac ctggtgttct acctccaagt tc tgtggatt ttgagaagca aaccctcctc '2160 
ccccaataaa gaagtttcca ccattcagga caccagagtc atgacattta aagccatttc ■ 2220 

tcagccattt atcctgggct gttcttgggg ccttggtttt tttatggttg aagaagtagg 2280 

gaagacgatt ggatcaatca ttgcatactc attcaccacc atcaacaccc ttcagggagt 2340 

gttgctcf.r.t gtggtacact gtciccttaa tcgccaggtt cgaatggaet ataaaaagtg 2400 

gtctagtggg atgcggaaag gggcagaaac tgaaagcact gagatgtctc gctctactac 2460 

ccaaaccaaa acggaagaag tggggaagtc czcagaaatc tttcataaag gaggcactgc 2520 

atcatcatct gcagagtcaa ccaagcaacc gcagccacag gttcatctcg tct.cngctgc 2580 

ttggctaaag atgaact.gac ctggcaagtg ccatggcaat gaeccggaag Ctaccccccc 2 540 

tttccgtttg tctacaccgc ccctgtggtc acacatagat cggacaaatg ccactatttc 27 CO 

tagctttcct gtgaaaagtc taggcccatt zacctatttt ggctccttat gttcatagaa 2760 

agaacaagac atttgggaga actcttagat ccagagtcca gtagcgtggc' acgtgcaatg 2820 

aagtgtcgga aggatgcatt Utaaagatgg cgggcgggag aagcggattt tcttctcgca 2 880 

gctactgcca ccttgccaga aactcactaa ctggcatctg gattcagctc a-ag-tccct 2940 

ttctggcctc tctgcrigtat ttcatgctcc Lcddyatctt acaccaacac cccacacrca 300U 

cataattcaa caacttccat atggatcagc attaaagagg gtgccgcatt ttgcaataca 3060 . 

aaaacgcatt atcaggtgct ggagaggacg tggagaaata ggaacacttt tacactgttg 3120 

gtgggactgc aaactagttc aaccatcgtg gaagrcagzg tggcgattcc tcagggatct 3180 



WO 00/06698 



PCT/US99/17130 



agaactdgaa ataccatttg acacagctat cccattaccg ggeatatacc ccaagggcta 3240 
taaatcatgc tgctataaeg acacatgcac acgtaaaaaa aaaaaaaaag ggcggccgct 3300 



<210> 115 
<2ll> 1286 . 

<212> DNA • ' - * 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1149) 

<222> n equals' a , t , g, or c.. 



<400> 115 , . .. 

. gactgcctta cggacattgg atgaagccga agcatctaga atggtgcctg gcac'acagtt 60 

ggtgcgtgat atggttaagc tttgtgtccc cacccacatc tcatcttgaa tgtgacggtt 120 

tccccggccc cctcctgccg cca tgtgaag aaggticgtr.g cttrc.cr.t.r.c accttrracc 180' ' 

accatgattg ccatggatgc tccccactcc aaagcagccc tggacagcat caacgagctg " 240 
cccgagaaca -cctgctgga gcLyLtcacg cacgtgcccg cccgccagct gccgctgaac ' '.300 

cgccgcctgg tctgcagcct_ ctggcgggca ccncatcgac ctcatgaccc tccggaaacg 360 
caacgcctgc gagagggctt catcaccaag gactgggacc agcccgtggc cgactggaaa." • 420 

atcttctatt tcccacggag cctgcatagg aac'tcctgcg caacccgtgt gctgaagagg 480 

atatgtttgc atggcaaatt gatttcaatg gtggggaccg ctggaaggtg gagagcctcc .540, 

ctggagccca cgggacagat: tctcctgacc ccaaagtceia gaagtatttt gtcacatcct ,600 

acgaaatgtg cctcaagccc cagc-ggtgg accttgtagc cgagggctac tgggaggagc 660 

tactagacac attccggccg gacaccgtgg ttaaggactg gtttgctgcc agagccgact 720 

gtggctgcac ctaccaactc aaagtgcagc tggcctcggc tgactacttc gtgttggcct 780 

ccttcgagcc cccacctgtg accatccaac agtggaacaa tgccacatgg acagaggtct 840 

cctacacctt ctcagactac ccccggggtg cccgctacat cctcttccag catgggggca 900 . 
gggacaccca gtactgggca ggctggtatg ggccccgact - caccaacagc. agcattgtcg ,960 

tcagccccaa gatgaccagg aaccaggcct cctccgaggc .tcagcctggg cagaagcatg 1020. 

. gacaggagga ggctgcccaa tcgccctacc ' gagctgttgt ccagattttc tgacagctgt . 1080 

ccatcctgtg tctgggtcag ccagaggttc ctccaggcag gagctgagca tggggtgggc 1140 

aagtgaggnc cctgtaccaa gcgacttctg cccugctt-ta .act-ctdccaei get tggggga 1200 

acttactgea cacagctctg acgttttgtt gtaataaatg ctttcaggcc gggcaaaaaa 1260 

aaaaaaaaaa . aaaaaaaaaa aaaaaa ? 1286 ■' 



<210> 116 

<211> ,2189 

<212> DNA 

<213> Homo sapiens 



<400> 116 . 
ggaatcegge acgaggcgcc cgaggatgtg ctgctggccg ctgctcctgc ■ tgtgggggct 60 

gctccccggg aeggeggegg gggectcgag ccgaacctat ccgcaccgga ccctcctgga 120 

cteggaggge aagtactggc tgggctggag ccagcggggc agecagateg ccttccgcct , 180 

ccacgrgcgc actgeagget acgtgggctz eggcttcteg cccaccgggg ccatggcgtc 240 

cgccgacatc gtcgtgggcg gggccgccca cgggcggccc cacctccagg attattttac ' 300 

aaatgcaaat agagagttga aaaaagatgc tcagcaagat taccatctag aatatgecat * 360 

ggaaaatagc acacacacaa caatcgaatc caccagagag ctgeatacat gtgacataaa * 420 

tgacaagagt ataaeggata gcactgtgag agtgatctgg gcctaccacc atgaagatgc 480 

aggagaagct ggtcccaagt accotgactc caataggggr ■ accadgagtt tgcggttatt ' • 540 
gaatcctgag aaaactagtc' tge-atctae agccttacca tactttgatc tggtaaatca' • 600 

cgacgtcccc accccaaaca aagacacaac atattggtgc caaatgtcta, agactcctgt. 660 ' 

cttccaagaa aagcatcatg caataaaggt tgagccagtg atacagagag gecatgagag * 72C . 

tctggt.gcac cacatcctgc tctatcagtg cagcaacaac ttcaacgaca gcgttcctgg 780 

aatcegggea cgaattccca tcaccccaac atgeccgaly tdLtcctcac ctgtgaaact 940 < 
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gtgacttttg cctgggctat tggtggagag ggcttttctt atccacctca tgctggatta 900 

tcccttggca ctccattaga tccgcattat gtgctcctag aagtccatto tgataatccq 960 

acttatgagg aaggcttaat agataattrt ggactgaggt tattttacac aatggatata 1020 

ayyaaalaLg aUgcLggyyt yaLLyaygcc ygcctctgyg LgagcctcLL ccataccacc 1030. 

cctccaggga tgcctgagtt ccagtctgag ggtcactgca ctttggagtg cctggaagag 1140 

gctczqqaaq ccgaaaagcc aagtggaatn catgtatttg ctqttcttct ccatgctcac 1200 

cccgctggca gaggcatcag gcf-gcgtcat tttcgaaaag ggaaggaaat gaaattactt 1260 

gcctatgatg atgattttga cttcaotttc caggagtttc agtatctaaa ggaagaacaa 1320. 

acaarcccac caggagataa cctaactact gagzgtcgct acaacacgaa agacagagct 13 80 

, gaga-gactt ggggaggact aagcaccagg agtgaaatgt gtctctcata ccttctttat 1440 

tacccaagaa LLaatctLac Lcgatgtgca agtattccag acaLtaugga acaact.r.nag 1500 

ttcaitgggg ttaaggagat* ctacagacca gtcacgacct ggcctttcat tatcaaaagt 1560 

cccaagcaat ataaaaacct ttcttzcatg gatgctatga aLaagtztaa atggactaaa i620 

aaggaaggtc tctccttcaa caagccggtc ctcagcctgc cagtgaatgt gagatgttcc ^ 1680 

aagacagaca atgctgagr.g gtcgactccn aggaatgaca gcattacctc cagatataga 1740 

aagaccctat aaagcagaac cctttggtgt gtggcacgtc ttcttcctct t-_-c-ctgcaca 1800 

gagatttctc ccatcaactt gcttgtttgc cttctgctac tcagctgcac gctgagcacc 1860 

aagagctcgt gatcaaaatt ctgtcggact tgacaatgtt ctccacgatc tgaacctgtc 1920 

an.LgaagLa caggttaaag actgtgtcca cttcgggcaL gaagagtgtg gagacttttc * 1980 

ttccccattt cccctccctc cttttccc-t tccatgctac atgagagaca tcaatcaggf. 2040 j 

tctcttctct ctcttagaaa tatctgatgt tatatacaca tggtcaataa aataaaactg 2IC0 

gcctgactta agataaccat tttaaaaaat tgggctgtca tgtgggaata aaagaaL Let 2160 

cc.cr.rtccta ooaaaaaaaa aaaaaaaaa ; . 2189 



<210> 117 

<-211> 1763 

<212> DMA 

<213> Homo sapiens 



<400> 117 

ggtggcctag agaegctget gccgcggtcg cagtcgtcgc gcacgcctct gcccgccagc 60 

ccgctccacc gccgtagcgc ccgagtgtcg ggygyegcac ccgagtcugg; ccatgaggcc 120 

gggaaccgcg ctacaggccg tgctgctggc cgtgctgctg gtggggctgc gggccgcgac 180 

gggtcgcctg ctgagtgggc agecagtctg ccggggaggg acacagaggc c"tgzta^aa 240 

agtcatttac ttccatgaza cttctcgaag actgaacttt. gaggaageca aagaagcctg 300 

caggagggga tggaggccag ccactcagca tcgagttctg aagatgaaca gaaactgata . 3 60 

gaaaakttca ttgaaaacct cttgccatct ' gatggtgayt. tctggawtgg gctcaggagg 420 

cgtgaggaga aacaaagcaa tagcacagcc gccaggacct ttatgcttgg actgatggca 480 

gcata-caca atttaggaac tggtatgtgg atgagccgtc ccgcggcagc gaegrctgeg 540 

tggtcatgta ccatcagcca tcggcacccg ctggcatcgg aggcccctac aigtzccagt 600 

ggaatgatga ccggtgcaac atgaagaaca atttcatttg caaatactct gatgagaaac 660 

, caqcaqttcc ttctagagaa gctgaagqtq aqgaaacaqa gctqacaaca cctq-acctc 720 

cagaagaaac acaggaagaa gatgecaaaa aaacatttaa agaaagtaga gaagctgect 780 

t.gnatctggc ctacatccta atccccagca ttccccttct cctcctcctt gkggrcacca 840 

cacntgea-g ttgggtttgg atctgtagaa aaagaaaacg ggagcageca gaccctagca 900 

caaagaagca acacaccatc tggccctctc ctcaccaggg aaacagcccg gacctagagg 960 

tctacaatgt catwagaaaa caaageqaaq ctgacttagc tgagaccegg ccagacctga 1020 

,agaatatttc actccgagtg egctegggag aagccactcc cgatgacacg tcttgtgact 1080 

atgacaacat ggctgtgaac ccatcagaaa gtgggtttgt gactctggtg agegeggaga 1140 

gtggatttgt gaccaatgac atttatgagt tctccccaga ccaaatgggg aggagtaagg 1200 

agtctggatg ggtggaaaat gaaatatatg gct.attagga cataraaaaa actgaaactg s 1260 

acaacaatgg aaaagaaatg ataagcaaaa tcctcttatt ttctataagg aaaatacaca 1320 

gaaggtctat gaacaagctt agatcaggtc ctgtggatga gcatgtggtc cccacgacct:* 13 80 

cctgttggac ccccacgttt tggctgeate crttatccca gccagtcatc cagctcgacc 1440 

ttatgagaag gtaccttgcc caggtctggc acatagtaga gtctcaataa atgtcacttg 1500 

gf.t-.ggttgta tctaactttt aagggacaga get tr.ac ct-.g gcagtgataa agatgggctg 1560 

tggagcttgg aaaaccacct ctgfctttcct tgetctatae agcagcacat attatcatac 1620 

agacagaaaa tccagaatct ttzcaaagee cacaratggt agcacaggtt ggcctgtgca 1680 
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tcggcaattc tcatacctgt ttttttcaaa gaiataaaatc aaataaagag caggaaaaaa 1740 
aaaaaaaaaa aaaaaaaccc gag 1763 

<2lb> 118 
<211> 1375 
<212> DMA 

<213> Homo sap i sr.s 



<220> 

<221> SITE 

<222> (18> 

<223> n ecfuals a t t,g, or c 



<400> 118 

. ggcgggcggc gcggaagngg cggckgcgcg gccggggcag ccatgtcgcc attgtctgcg 60 

gcgcgggcgg ccctgcgggt ctacgcggta ggcgccgcgg tgatcctggc gcagcrgctg 120 

cggcgccgcc gsgggggctt cctggagcca gctycccccc cacgacctga ccgtgrcgct 180 

atagtgacgg gagggacaga tggcattggc tattctacag cgaacatctg gcgagacttg , 240 

gcatgcatgt tatcar.agrrr. ggaaataacg acagcaaagc caaaeaagtt gtaagcaaaa 300 

taaaagaaga aaccttgaac 'gacaaagtgg aatttttata ctgtgacetg gettccatga 360 

cttccatccg gcagtttgtg cagaagttca agatgaagaa gartcctctc catgtcctga 420 

tcaacaatgc tggggtgacg atggtccctc agaggaaaac cagagatgga ttcgaagaac 480 

atttcggcct gaactaccta gggcacttcc tgctgaccaa ccttctctcg gatacgctga 540 

aagagtctgg gtcccctggc cacagtgcga gggtggtcac cgtctcctct gccacccatt 600 

acgtcgctga gccgaacatig gatgaccttc agagcagtgc ctgctactca ccccacgcag 660 

cctacgccca gogcaagccg ' gcccLtgtcc tgutcaccta ccacccccag cggctgctgg . 720 

cggccgaggg aagccacgcg accgccaacg tggtggaccc cggggtggtc aacacggacs 780 

tctacaagca cgtgttctgg gccacccgtc tggcgaagaa gcttctcggc tggttgcttt 840 

tcaagacccc cgatgaagga gcgtggactt ccatctacgc agcagtcacc ccagagctgg 900 

aaggagttgg tggccgttac ctatacaacg agaaagagac caagtccctc cacgtcacct 960 

acaaccagaa actgcagcag cagctgtggt ctaagagttg cgagatgact ggggcccctg 1020 

atgtgaccct gtgatatcct gtctcaggat agctgccgcc ccaagaaaca cactgcacct 1080 

gccaatagct. tgtgggtctg tgaagactgc ggtgtt-gag tttcccacac ccacctgccc ■ 1140 

acagqqctct gtcctctagr. r^.r.gagacag ctgcctcaac ctctgcagaa ctccaagaag 1200 

ccaaataaac attttggagg ataatcaccc caagtggcct tcaaccataa actctgtgat 1260 

cccaaagtgc ccagttgtca caggtgccat aaacaattac attttccaac ataaaaaaaa 1320 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaagggcg gccgc * 1375 



<210> 119 

<211> 1022 ' . . ; 

<212> DNA 

<213> Homo sapiens ' 1 

<220> 

<221> SITE ' '. . 

<222> (937) ' 

<223> n equals a.t.g. or c 

<220> . ' ■ 

<221> SITE " ' . 

<222> (990) ' ;. . f 

<223> n equals a,t,g, or c 

<40C> 119 

ggcacgagag agccrgcggc ccgagccggg gccatggcga actgctgagc cgcgtcctag 60 

gcccccggcc ctacaaaatc taccgggaga gggectctqa aagggccccg cccagcgccc 120 

ccgagacgcc aacggcagtc actgcccccc attccagcnc ctgggatacg cacraccagc 180 
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cccgtgccct ggagaaacat gctgacagca tcctggcact ggcttcagta ttctggtcca 240 

tctcttatta ctcctccccc ttcgccttct tctacttgta caggaaaggt tacttgagtt 300 

tgtccaaagt ggtgccgttc tctcactatg ctgggacatt gctgctacct ctggcaggtg 360 

tggcccgctc cgaggcattg gccgctggac caacccccag .taccggcagt tcatcaccat 420 

cttggaagca acacatcgga accagtcttc agaaaacaag aggcagcttg ccaactacaa 4B0 

ctttgacttr cggagctegc cagtcgactt ccactcggaa gaacccagca gccggaagga 540 

gtctcgaggg ggcccttccc gccggggtgt ggccctgctt cgcccagagc ccctgcaccg 600 

ggggacagca gacacccrcc ccaaccgggt taagaagctg ccttgtcaga tcaccagcta 660 

cctggtggcg cacaccctag ggcgccggat gctgtatcca ggctctgtgt acctgctgca 720 

gaaggccctc atgcctgtgc tgctgcaggg ccaggcccga ctggtggaag agtgtaatgg 7 80 

gcgccgggca aagctgctgg cctgtgatgg caatgagatt gacaccatg-t ttgtggaccg 840 

gcgggggaca gctgagcccc aggacagaag*. ctggigatct gctgcgargg gaatgctggg 90O 

ttttatgarg tgggctgcgt ctccamgccc ctggaanctg gatattcatc ckgggtggaa 960 

tcatccagct ttgctggaac acagggggtn ccattcccgc aaaatgaagg ctaatgccat. 1020 

gg 1022 



<210> 120, 

<211> 2311 ' • • 

<212> DNA ; ' " * 
<213> Homo sapiens 

<220> 

<221> SITE " ' 
<222> (654) 

<223> n equals a.t.g, or c 

<220> ' . - • • _ 
<221> SITE 

<222> (22931 , 
<223> n equals a, t;g, or c 

<400> 120 . * 

tcgacccacg cgtccgcttc ctcatgctgc ccgctcgcct accgttcagg c t get gage c 60 

ttttccctcg tggatccgct cccacggcag r.gr.gr.r.ar.gg cctccgggag ccgctcctgg 120 

agaggaggtg cgctgccgcc tcctccttcc agcactcatc gactct.ggga cgcgagcttc 180 

cttatgaccc cgtggacacg gagggctttg gagaaggtgg tgacacgcag gagcgttttc ' 240 

tgttcccgga gtacatcctg gatccggagc cgcaacccac ccgcgaaaag cagetgcagg 300 

agctccagca acagcaggag gaggaggagc gacagaggca gcagcggcgg gaggagcggc 360 

gacagcaaaa cctacgggcc aggtcccggg agcacccggt cgtggggcac ccggacccgg 420 

cattgccgcc cagcggcgtg aactgctcgg gctgcggggc asasctgcac tgccaggacg 480 

ccggagtgcc cggctocctg ccccgagaga agtCcctccg cacggcggag gcagacggcg 540 

ggctggcacg gaccgcgtgc cagcgctgct ggctgctgtc gcaccaccgg cgcgctctac 600 

gcctgcaggt gagccgcgag cagtacctgg agctggtgac cgccgcgttg cggnggcccg 660 

gcccctbcct ggtgetctar atggtggacc tgctggacct gcccgacgcc ctgctgcccg 720 

acttgcccgc gctggtgggc cccaagcagc tgatcgtgct gggaaacaaa gtggacctcc 7B0 

tgccccagga tgctcctggc f.accggcaga ggctgcggga gcgactgtgg gaggactgtg 840 

cccgergccgg gctcctgctg gcccctggca ccaagggcca cagcgccccg tcaaggacga 900 

• gccacaggac ggggagaa-c cgaatccgcc gaactggccc cgcacagtgg tcagggacgt 960 

gcggctgatc agcgccaaga ccggctatgg agtggaagag ttgatctctg cccttcagcg 102 0 

ctcctggcgc taccgtgggg acgtctactt: agcgggcgcc accaacgccg gcaaatccac 1080 

tctctttaac acgctccrgg agcccgatta ccgcaccgcc aagggctccg aggccatcga 1140 

cagagccacc atctcccctt ggccaggtac tacattaaac cttctgaagt' ttcctacttg' 1200 

caacccaact ccttacagaa tgtttaaaag gcatcaaaga taaaaaag ottcaactca 1260 

agctgaagaa gatctcagtg agcaagaaca aaatcagctt aatgtcctca aaaagcatgg 1320 

ttatgtcgta ggaagagttg gaaggacaLL cLtgta ixca gatigaacayct aggacaacar 1380 

tccctttgag tttgatgctg attcacttgc ctttgacatg gaaaa'cgacc ctgitatggg 1440 

cacacacaaa tccaccaaac aagtagaatt gactgcacaa gatgtgaaac atgcccactg 150C 

gttttaigac acccctggaa ttacaaaaga aaat.r.gt.att rtaaat.cttc taacagaaaa 1560 
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agaagcaaat. attgttttgc caacacagtc cattgttcea agaacttttg tgcttaaacc 
aggaatggtt ctgtttttgg gtgctatagg ccgcatagat ctcctgcagg gaaatcagtc 
.agcttggttt acagtcgtgg cctccaacat cctccctgtg catatcacct cctuggacag 
ggcagacgct ctgtaccaga agcatgcagg tcatacgt^a ctccagattc caatgggtgg 
aaaagaacga atggcrggat ttcctcctct tgttgccgaa gacatLatgt taaaagaagg 
actgggggca cctgaagcag tggccgacat caagtt ttcc tctgcaggtt gggtttcagt 
aacacctaat tttaaggaca gactgcatct ccgaggctat acacctgaag gaacagtttt 
gaccgtccgg ccccctctct t-g'ccatacat tgtcaacatc aaaggacagc gcatcaagaa 
aagtgtggcc- tataaaacca agaagcctcc ttcccttatg tacaacgtga ggaagaagaa 
aggaaagata aatgtatgag accgaccttg ttcactccag atattaactg tattgaacac 
aacaaaatac attgaatttg tattaaacat ataacgcata aataaagctc ccattcttac 
ccttaaaaaa aa'aaaaaaag ggcggccgct ctagaggatc caagctcacg tacgcgtgca 
tgcgacgtca tancrcgtct ataggaactg g 

<210> 121 
<211> 1286 

<212> DKA ' 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (1284) 

<223> n equals a,t,g, or c \ ' ' 



<400> ^21 

gggcgcgcgg gtgaaaggcg cat^gatgca gcctgcggcg gcctcggagc gcggcggagc 60 

agacgctgac . cacgttcctc tcctcggtct cctccgcccc cagctccgcg ctgcccggca 120 

gccgggagcc a-gcgacccc agggccccgc cgcctccccg cagcggctcc gcggcctcct 180 

gctgctcctg ctgctgcagc tqcccgcgcc gtcgagcgcc tctgagatcc ccaaggggaa 240 

gcaaaaggcg cactccggca gagggaggtg gtggacctgt acaatggaat gtccttacaa 300 

gggccagcag gagtgcctgg 'tcgagacggg agccctgggg ccaatggcat nccgggtaca 360 

cctgggatcc caggtcggga tggattcaaa ggagaaaagg gggaatgtct gacggaaagc 'V- 420 

tttgaggagc cctggacacc caactacaag cagtgttcat ggagttcatt gaattatggc 480 

atagatcttg ggaaaattgc ggagtgtaca tttacaaaga cgcgttcaaa tagtgctcta 540 

agagttttgt tcagtggctc acttcggcta aaatgcagaa atgcatgctg ccagcgttgg 600 

tatttcacat tcaatggagc tgaatgttca ggacctcttc ccattgaagc tataatttat 660 

ttggaccaag gaogccctga aatgaattca acaatcaata Ltcatcgcac ttcctctgtg 720 

gaaggacctt gtgaaggaat tggtgctgga t-agtggatg ttgctacctg ggttggcact 780 

cgttcagatt acccaaaagg agatgcttct actggatgga attcagtttc tcgcatcatt 840 

attgaagaac taccaaaata aatgctttaa tcttcatttg ctacctcttt tcctattatg 900, 

ccttggaatg gttcacttaa atgacatttt aaataagttt atgtatacat ctgaatgaoa 960 
agcaaagcta aatacgttta cagaccaaag tgtgatttca cactgttttt aaatctagca ' 1020 

ttattcatct tgcttcaatc aaaagcggtt tcaatatttt ttttagttgg ttagaatact ' 1080 

ttcttcatag tcacattctc tcaacctata atttggaata ttgttgtggt cttttgtttt 1140 

. ttctcttagt atagcatttt taaaaaaata Laaaagctac caaccttcgt acaatttgta 1200 

aatgttaaga attttttcta cacctgttaa ataaaaatca zttccaacaa aaaaaaaaaa 1260 

aaaaaaaaaa aaaaaaaaaa aaanaa • 1286 



<210> 122 
<211> 1380 
<212> DNA 

<213> Homo sapiens 



<400> 122 

cagacccgcg gggcaaacgg actggggcca agacccggga gcgcgggcgc aaaggcacca 60 

gggcccgccc agggcgccgc gcacacggcc ttgggggttc tgcgggcctt cgggtgcgcg 120 

cctcgcctct agccacgggg tccgcagcct tggagatcct gggcctggtg ctgtgccr.gg 180 



1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2311 
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-gggctgggg 
tggaccacaa 
tgcagagcac 
ggtgcaggcg 
cgcgaccctg 
tgcggccctc 
ctgctggttc 
gaagtacgag 
ggcggctgcc 
cccgtgaagt 
tacgtccgag 
ggataagccz 
cccctcggaa 
ccgcagctga 
tccttttctg 
cttccccicc 
act tcggagg 
ttactccatc 
cacuecagca 
gggaaggcta 



gggnctga tc 

catcgtgacg 

gggcacatgc 

gcgcgggcgc 

gcgggcgcgc 

acgggaggcg 

gccaacactg 

ctgggcgcac 

tcttgtgctg 

actcagcgcc 

ggcgctgggc 

ggggagcccc 

cgtccggctc 

ccttctcctg 

tgcgcggcgc 

aagacgctgg 

?gcggatgca 

ggcagggccc 

tctccccagg ' 

agaatccgct 



ctggcgtgcg 

gcgcagacca 

agtgcaaagt 

tcaccgtgag 

agtgcaccac 

tgctctacct-. 

tcgtccgcga 

gct.gtacatc 

cggcgcctgg 

gcggcggccc 

acggccgggc 

gcatggaccg 

tgcgccccga 

ccactagccc 

tgtttccata 

gggrcttggc 

gagcccaggg 

gagcccaggg , 

caaggcttgt 

tagtaaatgg 



ggctgcccat 

cctggaaggg 

gtacgactcg 

cgccgtgctg 

ctgcgtggcc 

gtt.ttgcggg 

gttttacgac 

ggc cgggcgg 

gtctgcaccg 

acggccaccg 

ccctcctgcc 

cggcttccgc 

cgcggccccc . 

ggccctgccc 

ggcagagcgg 

tgctgcctta 

cccccaccgg 

accagcgact 

gggcaccgga 

tttgaactct 



gtggcaggcg 
gctgtggatg 
gtgctggccc 
ctggcgtCcg 
ccgggcccgg 
ctgctggcgc 
ccguctgtgc 
ccaccgcgcc 
gccgtcccga 
gcgactacga 
agccaegect 
cgggtagcgc 
ggatccgctc 
ttaacagacg 
gtgtcagact 
cttcccagag 
aagatgtgta 
tggcctggac 
gcttgagaga 
caaaaaaaaa 



accgccttcc . 

Lcgtgcgtgg" 

tgagcaccga 

ttgcgctctt 

ccaaggcgcg 

tcgtgccact 

ccgcgccgca 

gcccacggta 

cctcagcttc 

caagaagaac >, 

gcgaggcgtt 

ggcgcgcagg 

ctgcctgcgc 

gaatgaagtt 

gaggatttcg 

gctcctgctg 

cacctggtct 

ctcccggtct 

gggcgggagt 

aaaaaaaaaa 



24C 
300 
360^ 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320. 
13 80 



<210> 123 
<211> 3793 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE ■ ■ : - 

<222> (1102) • ' . 

<223> n equals a,t,g, or c 

<220> t 
<221> SITE 
<222> (1132) . 

<223> n equals a , t,g, or c 

<220> . 
<221> SITE 
<222> (1199) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1228) 

<223> n equals a.t^g., or c 



<220> 

<221> SITS 
<222> (1229) 
<223> n equals 



a,t,g, or c * 



<220> 

<221> SITE 
<222> (1231) 

<223> n equals a,t,g. or c 



<220> 

<221> SITE 
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<222> (3176) 

<223> n equals a, eg, or c 



<400> 123 

ggggcttcat acaggaaatc tattgctgtg tcaagttcca gagaaaagct tctgttcgtc 60 

caagttacca accaggctaa accacar.aga cgtgaaggaa ggggctagaa ggaagggagt - 120 
gccccactgt tgatggggta agaggatcct gtactgagaa gttgaccaga gagggtctca ' 180 

ccatgcgcac agttccttct gtaccagtgt ggaggaaaag tactgagtga agggcagaaa 240 

aagagaaaac agaaatgetc tgcccttgga gaactgctaa cctagggcta ctgttgattt 300 

tgactatctc cttagtggcc gaageggagg gtgetgetea accaaacaac teattaatge 360 

tgcaaactag caaggagaat catgetttag cctcaagcag tttatgratg gatgaaaaac 420 

agattacaca gaactacccg aaagtactcg cagaagttaa cacttcatgg cctgtaaaga 480 

tggctacaaa tgctgtgctt tgtcgccccc "ccatcgcatt aagaaatttg atcataataa 540 

catgggaaar aatcctgaga ggccagcctr cctgcacaaa agcccacaag aaagaaacaa 600 

atgagaccaa ggaaaccaac tgtactgatg agagaataac ctgggtctcc agacc-gatc 660 

agaattegga ccttcagatt cgtaccgtgg ccatcactca tgaegggtat tacagatgea 720 

taatggtaac acctgatggg aatttccatc .gtggatatca cctccaagtg ttagttacac 780 

ctgaagtgac cctgcttcaa aacaggaata gaactgeage atgeaaggea gttgcaggga 840 

agccagctgc gcatatct-c • tggatcccag agggegattg tgccactaag caagaatact * 900 

ggagcaatgg cacagtcract gttaagagta catgccactg ggaggtccac aatgtgccta ' 960 

ccgtgaactg ccacgtctcc catttgactg gcaacaagag tctg-acata gagctacttc 1020 

ctgttccagg tgccaaaaaa tcatcaaaat tatatattcc atatatcatc c'ttactatta 1080 

ttatttcgac catcgtggga tncatccggt cgttgaaagc caatggctgc anaaaatata . 1140 

aattgaataa accagaatct actccagttg ttgaggagga egaaatgeag^ ccctatgcnt * 1200' 

tttacacaga gaagaacaat cccctctnng ntactacaaa caaggtgaag gcatctgagg 1260 

cattacaaag tgaag.tgac acagacctcc atactt.ata agttgttgga ctctagtacc 1320 

aagaaacaac aacaaacgag atacattata ottactgtcc. gatcttctta cagttctaga 1380 

atgaogactt atattgaaat taggtzttcc aaggttccta gaagacattc taatggattc 1440 

tcattcacac ccttgtataa ctggaatttt tgattcttag ctgctaccag ctagttccct 1500 

gaagaactga tgttattaca aagaaaatac atgcccatga ccaaatattc aaattgtgca 1560 

ggacagtaaa taatgaaaac caaatttcct caagaaataa ctgaagaagg agcaagtgtg 1620 

aacagtttct tgtgtaccct ttcagaatat tttaatgtac atatgacatg tgtatatgee 1680 

tatggtatat gtgecaautt atgtgtcccc ttacatatacr catgcaccct acctt-gtca 1740 

aggcaccagt. gggaacaata cactgeatta. ctgttctata catatgaaaa cctaataata 1800 

taagtctrag agatcatttt atatcatgac aagtagagct acctcattct ttttaatggt' 1860 

tatataaaat tccattgtat , agttatatca ttatttaatt aaaaacaacc ctaargatgg 1920 * 

atatttagat tcttttaagt tttgttt&tt zcttttaagt tttgtttgtg gtataaacaa 1980 

, taccacatag aatgtttct^ gtgeacatae ctcttcgttt tcgagtatac ctgtaggata • 2040 

actt-cttga gtggaattgt caggtcaaag ggtttgtgca tztttactatc gatatatatg 2100 

ttaaattgtg tcaaatatat atgtcaaatt ccctccaaca ttgtttaaa?: gtgcctttcc • 2160 

ctaaatttct attttaataa ctgtactatr cctgcttcta cagttgccac tttctctttt 2220 

taatcaacca gattaaatat gatgegagat tataataoga attatactat ttaataaaaa ■ 22B0 

tggatttata tttctggtca tgtttgtaag agagtgaatg caegtgegag aacatcagct 2340 

tcttctgaac tcattatatc tccacagagg tgttgatact tgatgcctaa cagttttgea -2400 

gatgtgctac attggaattg tg-attttr.a tggtgtacat tct'attgtga tatanttatt 2460 

gaataattaa t-.gtctattga ccatataagt ggcgaaaaat gcaccataga ggacatgggg 2520 

tatttattta caaactatga gctacataat aagcaagtgg ccatgggatg gcatgaccct 2580 

, cccctccata Uttccgtgga gcaaaatatt ggcaatgttt atgtaaatca ttgttiaatat 2640 

catgaaatta ttttraacta aaaacacaag tetatttget ccatagcaga aaaaacatga '2700 

gaagcttttt catcatgaca gaaattgaaa caaaatatat tcattcttca atcataccat 2760 

etg»gatt-t caagacagct attttgtctt ataagtatat ttttctccct ctagacattt 2820 

cagttactat ggarcttgtc ctcaciagygci cyLUcagtct aatLLcggga tgtaaagcta 2880 

atcttaatga cacttggcac atgacatcct gatcaageca ttttgacttg accaaaaagc- 2940 

agcgtccatc aggtttc^gc acataaatat taccaagcaa tgttcacaat agacatcatt 3000 

acactgtccc tgaaatctat taattcttca tccaaccctg gttgagctga ggctcatagt 3060 - 

taggttcaag actatcegtt taaatattar tgnanaacaa ogenngacog tactntget" , 3120 

acctcttaac ttgataatgt caaaccaggc atgttaaatg acatcataga aaaganttca 3180 

agacaattta tagaagttaa attatattgt acagaaaata attgeatgaa aatctccact 3240 

atggggctgg aacatggttg aacattagaa r.gatataaaa aattatatat atr.ctccaaa' 3300 
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tccacgctag acctgtcaaa tnagagaatc' tagagattag acctggcgtg tcagcaaggt .3360 

cacccaggaa gcagaggctg agacggagtt aggtgtgatt acttacatag tcgatfcacat 3420 

tttacaaata aca~tttata tgtctcattt actgtgcttt ctccccatcc catttcgtat . 3480 

v cttttccttt gctttgctag atttgtcaat cttctctctc tttctgtctc tctctctttc 3540 

aatatctcta ataatttgaa agtaattcac cataactaaa tatctattgg ggttatgctt . 3600 

cacctacaaa cttctqaaaa cggctttact gagatataat r.gatatattt aagtgtacag 3660 

tttgttaaat tttgcacata tttaaaatgt ggactttggt aaatgttcac atagt-ttac 3720 
atctgtgaaa ccatcagcat aatcaagata ataaacttgt ccatcacccc ccaaaaaaaa 
oaaaaaaaaa aaa 



<210> 124 

<211> 370 " ' ' . 

<212> PRT , ■ ■ 

<213>, Homo sapiens ' 

"<220> . 
<221> SITE - * 

<222> (370) ' _ • ■■ 

<223> Xaa equals scop translation 

■<430> 124 

Met Leu Gly Ala Phe Val Trp Pro Ser Leu Leu Leu Leu Ala Ala Ala 
' V 5 ■ ■ 10 15 

Cys He Cys Leu Leu Thr Phe lie Asn Cys Ala Tyr Val Lys Trp Gly 
20 • 25 30 

Thr Leu Val Gin' Asp He Phe* Thr Tyr Ala Lys Val Leu Ala, Leu He 
35 40 "45 

Ala Val lie Val Ala Gly He Val Arq Leu Gly Gin Gly Ala Ser Thr 
50 ' ' .55 fiO 

His Phe Glu Ash: Ser Phe Glu Gly Ser Ser Phe Ala Val Gly Asp lie 
65 70 75 80 

Ala Leu Ala Leu Tyr ser Ala Leu Phe Ser Tyr Ser Gly Trp Asp - Thr 
85 90 95 

Leu Asn Tyr Val Thr Glu Glu lie Lys Asn Pro Glu Arg Asn Leu Pro 

. 100 * 105 ' no' 

Leu Ser lie Gly He Ser Met Pro lie Val Thr Ile ; lie Tyr lie Leu 
115 ' ' '120 125 

Thr Asn Val Ala Tyr. Tyr Thr Val Leu Asp Met Arg Asp He Lou Ala 



130 



135 ' ■ 140 



Ser Asp. Ala Val Ala Val Thr Phe Ala Asp Gin He Phe Gl y Tie Phe 
145 . • 15Q 155 16C 

Asn Trp He He Pro Leu Ser Val Ala Leu Scr Cys Phe Gly Gly Leu 
165 170 . 175 

Asn Ala Ser Tie Val Ala Ala Scr Arg Leu Phe Phe Val Gly Ser Arg 
180 . .' 185 190 ■ 

Glu 'Gly His Leu Pro Asp Ala lie Cys Met He His Val Glu Arg Phe 



3780 
3793 



WO 00/06698 



85 



PCT/US99/17130 



195 200 

. Thr Pro Val Pro Ser Leu Leu Phe 
210 ■ • . 215 

Leu Cys Val Glu Asp lip Phe Gin 
225 '23 0 * 

Tyr Trp Phe Phe Val Gly Leu Ser 
245 

Trp Lys Glu Pro Asp Arg Pro Arg 
260 

Pro lie Val Phe Cys Leu Cys Thr 

275 " '. ' 2B0 



205 ■ 

Asn Gly He Met Ala Leu He Tyr 
220 

Leu He Asn Tyr Tyr Ser Phe Ser 
- 235 240 

He Val Gly Gin Leu Tyr Leu Arg 
250 255 

Pro Lei Lys Leu Ser Vel Phe Phe 
265 . 270 

He Phe Leu Val Ala Val Pro Leu 
28 5 • 



Tyr Ser Asp Thr He Asn Ser Leu He Gly He Ala He Ala Leu Ser •» 
■290 295 300 

Gly Leu Pro Phe Tyr 'Phe Leu He He Arg Val Pro Glu His Lys Arg 
305 310 315 t 320 

Pro Leu Tyr Leu Arg Arg Ser Trp Gly Leu Pro Gin Glv Thr Ser Arg 

325' - 330 • .335 

Ser Cys Val Cys Gin Leu Leu Gin Lys Trp He Trp Lys Met Glu Glu. 
340 . 345 350' 

Arg Cys Pro Ser Asn Gly He Pro Ser Leu Thr Lys His, His Leu Glu 
355 360 365 

Ser Xaa 

370' 



<210> 125 • 
<211> 86 
<212> PRT 

<213> Homo, sapiens 
<220> 

<221> SITE 
<222> (86) 

<223> Xaa equals stop translation 
<400> 125 

Met Gly Phe Trp Cys Gly Cys Pro Phe Cys Leu Leu Val Val Leu Leu 
1 ' 5 10 15 

Thr Asp Arg Thr Leu Scr Cys Arg Ser Vol Gly Veil Pro Cys Asn Val 

20 - 25 30 

Arg Cys Gin Cys Ala , Pro Ala Gly Gly Cys Leu Pro Val Arg Leu Leu 
35 40 45 ' 

Ala Gly Gin Gly Ser Gly Thr His Leu Arg Arg Gin Ser Ala Arg Ser 
50 =5 60 
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Gin, He Ser Ser Cys Met Leu Gly G]u Pro Leu Leu Ser Ser Lys Leu 
65 70 75 80 

Ser Asp Arg Asp lie Xaa 
85 



<21Q> 126' . . 

*:2ll> 44 
<212> PRT 

<213> Homo sapiens *' 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals. stop, translation 
, <400> 126 

Met Tyr Thr Lys Thr His Lys Phe Lys Phe Tyr Asn Phe Leu Ser Leu 
1 5 10 ' 15 

Trp He Trp Lys lie Phe Phe Leu Leu Phe Phe He Leu lie Val Ala' 
20 ' 25 30 

Leu Ala Phe Pro He Pro Cys Leu Ser lie Phe Xaa 
35 40 



<210> 127 . < 
<211> 319 

<212> PRT ' 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (264) 

<223> Xaa equals any of the naturally occurring L-amino acids . 

<220> " 

<221> SITE 

<222> (303) ■ ■ . 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 127 

Ket Asn Thr Asp His Leu Arg Leu. Thr Val Pro Asn Gly He Gly Ala 
1 , 5 10 15 

Leu Lys Leu Arg Glu Met Clu His Tyr Phe Ser Gin Gly Leu Ser Val 

-.20 , 25 30 

Gin Leu Phe Asn Asp Gly Ser Lys Cly Lys Leu Asn His Leu Cys Gly 
35 , 40 45 

Ala : Asp Phe Val Lys Ser. His Gln : : Lys Pro Pro Gin Gly Met Glu He. 
50 55 60 

Lys, Ser Asn Glu Arg Cys Cys Ser Phe Asp Gly Asp Ala Asp Arg He 
.65 7 0- 75 8C 
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Val Tyr Tyr Tyr His Asp Ala Asp Gly His Phe His Leu Tie Asp Gly 
85 " 90. 95 ■ 

Asp Lys lie Ala Thr Leu lie Ser Ser Phe Leu Lys Glu Leu Leu Val 
100 105 110 

Glu lie Gly Glu Ser Leu' Asn lie Gly Val Val Gin Thr Ala Tyr Ala 
• 115 . 120 125 

.Asn Gly Ser Ser Thr* Arg Tyr Leu Glu Glu Val Met Lys Val Pro Val 
: 130, 135 ,140 

Tyr Cys Thr Lys Thr Gly Val Lys His Leu His His Lys Ala Gin Glu 
145 150 ; , 155 ' 160' 

Phe Asp lie Gly Val Tyr Phe Glu Ala Asn Gly His .Gly Thr Ala Leu ; 

165 i 170 175 

Phe Ser Thr Ala Val Glu Met Lys lie Lys Gin Ser Ala Glu Gin Leu 
180 IBS 190 

Glu Asp Lys Lys Arg Lys^ Ala Ala Lys Met Leu Glu Asn lie lie Asp 
195 • 200 205 

Leu Phe Asn, Gin Ala Ala Gly Asp Ala lie Ser Asp Met Leu' Val lie" 
210 215 -'. 220 

Glu Ala lie Leu Ala* Leu Lys Gly Leu Thr- Val Gin Gin Trp Asp Ala 
22 5 230 *. 235 240 

Leu Tyr Thr Asp Leu Pro Asn Arg Gin Leu Lys Val Gin Val Ala Asp 
24 5 250 255 

Arg Arg Val He Ser Thr Thr Xaa Ala Glu Arg Gin Ala Val Thr Pro 
.260 265 '270 

Pro Gly Leu Cln.Glu Ala lie Asn Asp Leu Val Lys Lys Tyr Lys Leu 
275' 280 285 

Ser Arg Ala Phe Val Arg Fro Ser Gly Thr Glu Asp Val Val Xaa Ser 
290 * 295 ' 300 

lie Cys , Arg Ser Arg Leu Thr Arg Lys -Cys Arg Ser. Pro Cys Thr 4 
305 ; '310 315 



<210> 128 

<211> 46 , 
<212> FRT 
<213>'Homo sapiens 

<220> ; ' 
<221> SITE : 

<222> (46) ^ 
<223> Xaa equals stop translation 

<400> 12 8 , . . 

Me-: Asp Ket val Cys Phe Cys He Tyr Leu Gly Leu Leu Lys Phe He 
' 1 5 10 -15 
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Phe Cys Ser Phe Ser Glu Glu Vail " Leu Tyr lie Ser Phe 
20 - 25 30 

lie Pro Lys Tyr' Phe Val Glu Met Leu Leu Xaa 

■ -'40 . '• 45 • 

v<:210> 129 \ ' , , ' > 

<211> 709 , ■ ., • 

<212> PRT ' '' . , ■ 

<213> Homo sapiens 

<220> . • ; 

<221> SITE . . . 

<222> (189) ' ' 

<223> Xaa equals any of the. naturally occurring L-amino acids ' 

<220> . ■ " \ 

<221>' SITE \ , 

<222> (275) , ■ * . 

<223> Xaa equals any of Che naturally occurriny L -amino, acids 

<220> ■ • • ■■ .. .. 

<221> SITE ( ' . . 

<222> (414) " 

<223> Xaa equals any of the naturally occurring L^aioiiiu- acids . 

<220> ' 

<221> SITE 

<222> (438) , v 

<223> Xaa equals any of the naturally occurring L-ani no acids 

<220> • 
<221> SITE 

<222> (641) . f 

<22 3> Xaa equals any of the naturally occurring L-amino acids 

<220> _ 

<221> SITE ' , ' . 

<222> (643) * '., ". 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> ' " ' ' 

<221> SITE 

<222> (696) ' . '. ' ■ 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> ■ »•• ..' * 

<22i> SITE * ;> 

</22> (697) • , 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 129 

Met Ala Gly Leu'Asn Cys Gly Val Ser He Ala Leu Leu Gly Val' Leu 
1 5 . ' 10 ; 15 • 

Leu Leu Gly Ala Ala Arg Leu Pro Arg Gly Ala Glu' Ala Phe Glu He 
20 25 • ; 30 



Ser Ala lie 

Val Lys Cys 
35 
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Ala Leu Pro Arg Glu Ser Asn lie Thr Val'Leu lie Lys Leu Gly Thr 
35 40 45 

Pro Thr Leu Leu Ala Lys Pro Cys Tyr He Val He Ser Lys Arg His' 
50 55 ..-*■.. 60 

He Thr Met Leu Ser He Lys Ser Gly, Glu Arg He Val Phe Thr Phe 
65 70 75 • ' 80. 

Ser Cys, Gin Ser Pro Glu Asn His Phe Val He Glu lie Gin Lys Asn 

85 90 .. \ 95 •• 

He Asp Cys Met Ser Gly Pro Cys Pro Phe Gly. Glu Val Gin Leu Gin 
100 105 110 ■ , 

Pro Ser Thr. Ser Leu Leu Pro Thr Leu Asn Arg Thr Phe He Trp Asp 
115 ' 120 125 

Val Lys Ala His Lys Ser He Gly Leu Glu Leu Gin Phe Ser lie . Pro 
130 • 135 , 140 

Arg Leu Arg Glu lie' Gly Pro Gly Glu ser cys Pro Asp Gly Val Thr. . 
.145 150 155 160 

His Ser He Ser Gly Arg He Asp Ala Thr Val Val Arg lie Gly Thr 
165 170 175 I 

Phe Cys Ser Asn Gly Thr Val Ser Arg lie Lys Met Xaa Glu Gly Val 
180 185 190 

Lys Met Ala Leu 'His Leu Pro Trp Phe His Pro Arg Asn Val' Ser Gly 
195 ■ a 200 205 

.Phe Ser He Ala Asn Arq Ser Ser lie Lys Arg Leu Cys He He Glu 
210 215 220 ■ . 

Ser Val Phe Glu Gly Glu Gly Ser Ala Tnr Leu Met Ser Ala Asn Tyr 
225 230 235 ' 240 

Pro Glu Gly Phe Pro Glu Asp Glu Leu Met Thr Trp Gin *Phe Val Val 
245 250 255 ' 

Pro Ala His Leu Arg Ala Ser Val Ser Phe Leu Asn Phe Asn Leu Ser 

260 , 265 270 ■' • • 

Asn Cys Xaa Arg Lys Glu Glu Arg Val Glu Tyr Tyr He Pro Gly Ser 
275- 280 2Bb 

Thr Thr Asn Pro Glu Val Phe ' Lys . ( Leu Glu Asp Lys Gin Pro Gly Asn 
290 295 ' 3C0 

Met Ala Gly Asn Phe Asn Leu Ser Leu Gin Gly Cys Asp Gin Asp Ala 
. 3 °5 31C 315 320 

Gin Ser Pro Gly He Leu Arg Leu Gir. Phe Gin Val Leu Val Gin His 
325 330 •» 335 

Pro Gin Asn Glu Scr Asn Lys lie Tyr Val Val Asp Leu Ser Asn Glu 
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340 .345 350 

Arg Ala Met Ser Leu Thx He . Glu Pro Afg Pro Val Lys Gin Ser Arg 
355 360 ' 365 

Lys Phe Val Pro Gly Cys Phe Val Cys Leu Glu Ser Arg Thr Cys Ser 
370 375 380 ■• 

Ser Asn Leu Thr Leu Thx' Ser Gly ser Lys His Lys lie Sex Phe Leu 
395 390 '395 ' 400 

Cys Asp Asp Leu .Thr" Arg Leu Trp Met Asn Val Glu Lys Xaa lie Ser 

' fl • • 405 410 ' > l 415 

Cys Thr Asp His Arg Tyr. Cys Gin Arg Lys Ser Tyr Ser Leu Cln Val 
'■ . ■ 420 425 ■ 430 

Pro Ser Asp lie Leu Xaa- Leu Pro Val Glu Leu His 'Asp Phe Ser Trp 
435 440 ' •. • 445 

Lys Leu Leu Val Pro Lys Asp Arg Leu Ser Leu Val Leu Val Prs Ala 
. 450 \ ' 455 460 

Gin Lys Leu Gin Gin His. Thr His Glu Lys ' Pro Cys Asn Thr Ser Phe 
465 • . 470 47 5 430 

Ser Tyr Leu Val Ala Ser Ala He Pro Ser Gin Asp Leu Tyr Phe Gly 
• 485 s 490 495 

Ser Phe Cys Pro Gly Gly Ser He Lys Gin He Gin Val Lys Gin Asn 
500 505 • 5.10 

He Ser Val Thr Leu Arg Thr Phe Ala Pro Ser Phe Arg Gin Glu Ala 
515 . 520 525 

Ser Arg Gin Cly Leu. Thr Val Ser Phe He Pro Tyr Phe Lys Glu Glu 
530 . 535 540 

Gly val Phe Thr Val Thr Pro Asp Thr Lys Ser Lys Val Tyr Leu Arg 
545 550 555 560 

Thr Pro Asn Trp Asp Arg Gly Leu Pro Ser -Leu Thr Ser Val Ser Trp 
565 57C . 575 

Asn He ser Val Pro Arg Asp Gin Val Ala Cys Leu Thr Phe Phe Lys 
580 585 590 

Glu Arg Scr Gly Val Val Cys Gin Thr Gly Axy Ala Phe Met He lie 
595 500 635 

Gin Glu Gin Arg Thr Arg Ala Glu Glu He Phe Ser Leu Asp Glu Asp 
610 ' 615 620 

Val Leu Pro Lys, Pro Ser Phe His His His Ser Phe Trp Val Asn He 
625 630 635 640 

Xaa Asn Xaa Ser Pro Thr Ser Gly Lys Cln Leu Asp Leu Leu Phe Ser 
645 ' 650 ' * 655 
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Val Thr Leu Thr Pro Arg Thr Val Asp Leu Thr Val He Leu He Ala 

• t ' 660 ; . 665 670 

Ala Val Gly Gly Gly Val Leu Leu Leu Ser Ala Leu Gly Leu He He 
675 680 685 . ' 

Gys Cys Val Lys Lys Lys Lys Xaa Xaa Thr Arg Gly Pro Ala Val Gly 
690 695 700 ' ' 

He Tyr Ash Gly Asn »■ ■ 

705 



<210> 130 „ ' 

<211> 415 

<212> PRT ' * 

<213> Homo sapiens 

<220> ■ ■ ' '.' 

<22i> SITE 

<222> (415) ... 

<223> Xaa equals stop translation ■;* 

<400> 130 \ . 

Met Thr Lys Ala Arg Leu Phe Arg Leu Trp Leu Val Leu Gly Ser Val 

.1 . - 5 10. ^ • 15 

Phe Met He Leu Leu He He Val Tyr Trp Asp Ser Ala Gly Ala Ala 
20 .25 30 

His Phe Tyr Leu His Thr Ser Phe Ser Arg Pro His Thr Gly Pro Pro 
35 . 40 45 v 

Leu Pro' Thr Pro Gly Pro Asp Arg Asp Arg Glu Leu Thr Ala Asp Ser 
50 ,55 . 60 

Asp Val Asp Glu Phe Leu Asp Lys Phe Leu ser Ala Gly Val Lys Gin 
• 65 70 75 80. 

Ser Asp Leu- Pro Arg Lys' Glu Thr Glu Gin Pro Pro Ala Pro, Gly Ser 
85 90 95 

Met Glu, Glu Asn Val Arg Gly Tyr Asp Trp Ser Pro Arg Asp Ala Arg 
100 105 • 110 

Arg_Ser Pro Asp Gin Gly Arg Gin Gin Ala Glu Arg Arg Ser Val Leu 
115 12 0 12.5 

Arg Gly Phe Cys Ala Asn Ser Scr Leu Ala Phe Pro Thr Lys Glu Arg 
130 135 140 

Ala Phe Asp Asp He Pro Asn Ser Glu Leu Ser His Leu He Val Asp 
145 "-50 155 160 

Asp Arg Kis.Gly Ala lie Tyr Cys Tyr Val Pro Lys Val Ala Cys Thr 
165 170 175 

Asn Trp Lys Arg Val Met He Val Leu Ser Gly ser Leu Leu His Arg 
180 185 190 
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Gly Ala Pro Tyr Arg Asp Pro Leu Arg lie Pro Arg Glu His Val His 
195 200 •• 205 

Asn Ala Ser Ala His Leu Thr Phe Asn Lys Phe Trp Arg Arg 'Tyr Gly 
210 ' 215 220 

" Lyn Leu ser Arg His Leu Met Lys Val Lys Leu Lys Lys Tyr Thr Lys 
225 230 235 240 

Phe Leu Phe Val Arg Asp Pro. Phe Val Arg Leu He. Ser Ala Phe Arg 
. 245 250 s 255 

Ser Lys. Phe Glu Leu Glu Asn Glu Glu Phe Tyr Arg Lys Phe Ala Val ♦ 
260 265 270 

Pro Met Leu Arg Leu Tyr Ala Asn His Thx Ser Leu Fro- Ala Ser Ala 
'275 280 ' 285 

Arg Glu Ala Phe Arc Ala' Gly Leu Lys Val Ser Phs Ala Acn Phe lie 
29G 295 300 

Gin Tyr Leu Leu Asp Pro His Thr Glu Lys Leu Ala Pro Phe Asn Glu 
305 310 315 320 

His Trp Arg Gin Val Tyr Arg Leu Cys His Pro Cys Gin He Asp Tyr 
325 330 335 

Asp Phe Val Gly Lys Leu Giu Thr Leu Asp Glu Asp Ala Ala Gin Leu' 
340 345 350 

Leu Gin Leu . Leu Gin Val Asp Arg Gin Leu Arg Phe Pro Pro Ser Tyr 
.155 , 360 '365 

Arg Asn Arg Thr Ala Ser Ser Trp Glu Glu Asp Trp Phe Ala Lys He 
370 '375 380 

Pro Uu 'Ala 'Trp Arg Gin Gin Leu Tyr Lys Leu Tyr Glu Ala Asp Phe 

385 .' 390 ■ ' 395 ■ ■" 403 y 

Val Leu Phe Gly Tyr Pro Lys Pro Glu Asn Leu Leu Arg Asp Xaa 
4C5 410 415 



<210> 131 r . ■ ■ 

<211^ 242 
<212> P3T 

<213> Homo sapiens ' * 

• <400> 131 

Met Gin Leu Gly Ser Val Leu Leu Ihr Arg Cys Pro . Phe Trp Gly Cys 

1 ' 5 ' 10 . _ 15 ' 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 
20 25 30 

Pre Asp He Pro Val Pro Tyr Leu Tyr Phe Asp Met Giy Ala Ala Val 
35 40 45 
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Lsu Cys Ala Ser Phc Met 3er Phe Gly Val' Lys Arg Arg Trp Phe Ala 
. , 50 55 1 . 60 

Leu Gly Ala Ala' Leu Gin Leu Ala lie Ser Thr'fyr Ala Ala Tyr He 
65 70 75 • •80 

Gly Gly Tyr . Val Kis Tyr Gly. Asp Trp Leu Lys Val Arg Met Tyr -Ser 
85 90 95 

- Arg Thx Val Ala He lie Gly Gly Phfi Leu Val Leu Ala Ser Gly Ala 
100 105 • 110 

Gly Glu Leu tyr Arg Arg Lys Pro Arg Ser- Arg Ser Leu Gin Ser Thx 
115 120 , , ' 125 ' 

Gly Gin Val t Phe Leu Gly lie Tyr Leu He Cys Val Ala Tyr Ser Leu 

' ' 130 . '•' 135 ' 140 \ ' \ 

' Gin His Ser Lys Glu Asp Arq Leu .Ala Tyr Leu Asn His Leu Pro Gly 
145; 150 .' 155 , 160 ' 

Gly Glii Leu Met He Gin Leu Phe Phe Val Leu Tyr Gly He Leu Ala 

'- t ' ' 155 170 . .175- - 

Ley Ala Phe Leu Ser Gly Tyr Tyr Val Thr Leu Ala Ala Gin He Leu 
' • ' 180. 185 190 

Ala Val Leu Leu Pro Pro Val Met Leu Leu lie Asp'Gly Asn Val Ala 
195 ,200 205 

Tyr Trp His Asn Thr Arg Arg Val Glu Phe Trp Asn! Gin Met Lys Leu 
210 215 220 

Leu Gly Glu Ser Val Gly lie Phe Gly. Thr Ala Val He Leu Ala Thr 
' 225 . 210 235 240 

Asp Gly ' ■ , 



<210> 132 , . . ' ' , 

<211> 313 " , * 

<212> FRT 

<213> Homo sapiens 

<220> \ . 

<22l> 'SITE 
<222> (313) 

<223> Xaa equals stop translation 
<400> 132 .' t . 

Met Glu Ser Leu Tyr Asp Leu Tro Glu Phe Tyr Leu Pro Tyr Leu Tyr 
1 5. * ■ 1 10 • 15 

Ser Cys He Ser Leu Met Gly Cys Lsu- Leu Leu Leu Leu Cys Thr 'Pro 
' 20 25 30 



Val Gly Leu Ser Arg Met Phe' Thr Val Met Gly His Leu Leu Val Lys 
,35 40 45 
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Pro Thr lie Leu Glu. Asp Leu Asp Glu Gin He Tyr He He Thr Leu, 
50 ■ •• • - .55 • 60 

Glu Glu Glu Ala Leu Glri Arg Arg Leu Asn Gly Leu Ser Ser Ser Val 
65 70 '.75 bo. 

Glu Tyr Asn Tie Met Glu Leu Glu Gin Glu - Leu Glu Asn Val Lys Thr 

85. 90 ' \ - ■ 95 

Leu Lys Thr Lys Leu Glu Arg Arg Lys Lys Ala Ser Ala Trp Glu Arg 
10 0 105 u HO 

Asn Leu Val Tyr Pro Ala" Val Met Val Leu Leu Leu He Glu Thr Ser 
115 120 ■ -125 •' " 

He Ser Val Leu Leu Val Ala Cys Asn He jeu Cys Leu Leu Val Asp 
130 v ,135 140 ' 

Glu Thr Ala Met Pro Lys Gly Thr Arg GJy Pro Gly Tie Gly Asn Ala 
14 5 150 155 160 

Ser Leu Ser Thr Phe Gly Phe Val Gly Ala Ala Leu Glu He lie Leu 
165 170 175 

He Phe Tyr Leu Met Val Ser Ser Val Val Gly Phe Tyr Ser Leu Arg 
180' 185 190 

Phe Phe Gly Asn Phe Thr Pro Lys Lys Asp Asp Thr Thr Met Thr Lys 
195 200 . •• 205 

lie. He Gly Asn Cys Val Ser He Leu Val Leu Ser Ser Ala Leu Pro 

.210 ' 215 ' 220 

Val Met Ser Arg Thr Leu Gly He Thr Arg Phe Asp Leu Leu Gly Aso 
. 225 230 235 240 , 

Phe Gly Arg phe Asn Trp Leu Gly Asn ?he 'iyr He Val Leu Ser Tyr 
245 250 . 255 

Asn Leu Leu Phe Ala lie Val- Thr Thr Leu. Cys Leu Vai Arg Lys Phe 
2 *0 ( 265 . . 270 . 

Thr Ser Ala Val Arg Glu Glu Leu Phe Lys Ala Leu Gly Leu His Lys 
275 .280 . . 285 

Leu His Leu Pro Asn Thr Ser Arg Asp Ser Glu Thr Ala Lys- Pro Ser ■ 
29 ° 295 300 

Val Asn Gly His Gin Lys Ala Leu Xaa - - 
305 310 



<210> 133 
<211> 183 
<212> PPT 

<213> Homo sapiens 



<220> 
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<22l> SITE 

<222> (183) > 

<223> Xaa equals stop translation 

<400> 133 ' 

Met Met Val Cys Ser He Met Met Tyr Phe Leu Leu Gly He Thr Leu 
1 5 10 15 

Leu Ary Ser Tyr Met Gin Ser Val Trp Thr Glu Glu Ser Gin Cys Thr 

r 20 * . 25 30 ,,f 

Leu Leu Asn Ala Ser He Thr Glu Thr Phe Asn Cys Ser Phe Ser Cys ' 
35 t '40 ' 45 

Gly Pro Asp Cys Trp Lys Leu Ser Gin Tyr Pro Cys Leu Gin Val Tyr 
, 50 , 55 60 

Val Asn Leu Thr Ser Ser Gly. Glu jys Leu Leu Leu Tyr. His Thr Glu 
65 70 • . 75 • . • < ,80 

Glu Thr He Lys He Asn GlnLys Cys Ser Tyr He Pro Lys Cys Gly 
85 ' '? 90 95 ' ' ' 

Lys Asr.. Phe Glu Glu Ser Ke~ Ser Leu Val Asn Val Val Met Glu Asn 
\ 10 0 ' 105 110 

Phe Arg Lys .Tyr Gin Hie Phe Ser Cys Tyr Ser Asp Pro Glu Gly Asn 
115 120 125 1 

Gin Lys Ser Val He Leu Thr Lys Leu Tyr Ser Ser Asn Val Leu Phe 
130 135 140 ■ 

*;His Ser Leu Phe Trp Pro Thr Cys Ker Met Ala Gly Gly Val Ala lie 
' i45 - 150 155 ' 160 

Val Ala Met Val Lys Leu Thr Gin Tyr Leu Ser Leu Leu Cys Glu Arg 
165' 170 ■ : 175 

lie Gin Arg He Asn Arg Xaa 
180 



<210> 134 ' • ' 

<211> 147 ■ 

<212> PRT . 
<213> Homo sapiens * < 

<220> " ■ • . 

<221> SIT£ 

<222> (147) . *' ■ ■ * t ■ 

<223> Xaa equals stop translation . 

<4O0> 134 

Met Trp Lys Leu. Trp Arg Ala Glu Glu Gly Ala Ala Ala Leu Gly Gly 

1 5 • ' : 10 • ■' • 15' 

Ala Leu Phe Leu Leu Leu Phe Ala Leu Gly Val Arg Gin Leu Leu Lys ' 
' . • 20 25 . 30 
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Gin Arg Arg Pro Met Gly Phe Pro 
35 40 

He Gly Asn He Tyr Ser Leu Ala 
50 ' 55 . 

Tyr Met Arg Lys Gin ,Ser Gin Val 

Ala Pro Gly Arg Glu Gly Arg Gin 
85 

Ser Trp Leu Asp Leu Trp Pro Pro 
100 

Pro Cys Ala Gly Cys Pro Lou Arg 
115 120 

Gly Arg Ser Pro Arg Arg Arg Ala 

130 P . <■ 135 



Pro Gly Pro Pro Gly Leu Pro Phe' 
45 

Ala Ser Ser Glu Leii Pro His Val 
.60 

Tyr Gly Glu Val Gin Pro Arg Arg 

75 80- 

Ala Gly Pro Gly Trp Pro Gly Pro 
90 95 

Leu Gly Arg Leu Val Gly Thr Ser 
105 . 110 

Asp Thr, Arg Phe Pro Cly Leu Glu 
125 . 

'Pro Leu Gin Gly Glu ! Pro Arg Pro 
140 



Cys Arg Xaa , .■. . ' 

-.145 1 • • 

<210> 135 •.. , ' 

<211> 122 

<212> PRT ' 
- -<213> Homo sapiens 

<400> 135 

Met Arg Val Arg lie Gly Leu Thr Leu Leu Leu Cys Ala Val Leu Leu 

1 ' 5 . 1C • .15 , 

Ser Leu Ala Ser Ala Ser Ser Asp Glu Glu Gly Ser Gin Asp Glu Ser 
20 25 , 30 

Leu Asp Ser Lys Thr Thr Leu Thr Ser Asp Glu Ser Val Lys Asp His 
35 * 40 45 

Thr Thr Ala Gly Arg Val Val Ala Gly Gin lie Phe Leu Asp' Ser Glu 
' SO 55 60 

Glu Ser Glu Leu Glu Ser'Ser lie Gin Glu Glu Glu Asp Ser Leu Lys 
65 70 75 80 

Ser Gin Glu Gly Glu Ser Val Thr Glu Asp He Ser Phe Leu Glu Ser, 
,85 90 95 

Pro Ash Pro Glu, Asn l.,ys Asp Tyr Glu Glu Pro Lys Lys Val Arg Lys 

100 '. >.;■...• 105 ' ' 110 

Pro Gly ser Leu' Asp He Phe Leu Ala Phe 
US • 120 



<210> 136, 
<211> 112 
<212> PRT 
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<213> Homo sapiens . 
<400> 136 

Met Ala Arg Gly Ser Leu Arg Arg Leu Leu Arc Leu Leu Val Leu Gly 
r 1 5 10 15 

Leu Trp' Leu Ala Leu Leu Arg Ser Val Ala Gly Glu Gin Ala Pro Gly 
20 25 30 

Tfcr Ala Pro Cys Ser Arg Gly Ser Ser Trp Ser Ala Asp Leu Asp Lys 
35 ' 40 45 

Cys Met Asp Cys Ser Thr Ser Cys Pro Leu Pro Ala Ala Leu Ala His' 
50 55 60 

Pro Trp -Gly Arg Ser 4 Glu Pro Asp Leu Arg Ala Gly Ala Ala Phe Trp 
65 70 75 ■ 80 

Leu Phe Gly Leu Glu Thr Met Pro Gin Arg Glu Lys Phe Thr Thr Pro 
85 ' 90 95 

He Glu Glu Thr Gly Gly Glu Gly Cys Pro Ala Val Ala Leu He Gin 
100 . 1Q5 110 



<210> 137 
* <211> 140 
<212> PR*? * 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (140) 

<223> Xaa equals stop translation 

<400> 137 

Met Leu Leu Gly Pro Val Pro He Leu His lie Lys Ser Gin Leu Trp 
1 • 5 10 15 

Leu Leu Val Leu He Leu Val Val Ser Gly Leu Ser Ala Gly Met Ser 
20 25 30 

He He Pro Thr Phe Pro Clu lie Leu Ser Cys Ala His Glu Asn Gly 
.35 • 40 45 

Phe Glu Glu Gly Leu Ser Thr Leu Gly Leu Val Ser Gly Leu Phe Ser 
50 • ' 55 60 

Ala Met Trp Ser He Gly Ala Phe Met Gly Pro Thr Leu Gly Gly Phe 
. 65 70 75 80 

Leu Tyr Glu Lys He Gly Phe Glu Trp Ala Ala Ala lie Gin Gly Leu 
85 90 95 

Trp Ala Leu lie Ser Gly Leu. .Ala Met Gly Leu Phe Tyr Leu Leu Glu 
100' * 105 110 
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Tyr Ser Arg Arg, Lys Arg Ser Lys Ser Gin Asr. lie Leu Ser Thr Glu 

1.15 120 •' 125 

Glu Glu Arg Thr Thr Leu Leu Pro Asn Glu Thr Xaa 

130 ' 135 140 



<210> 138 

<2ll> 404 

<212> PRT 

<213> Homo sapiens 

<400> 138 

Met Arc Leu Gin Asp Val Tyr' Met Leu Asn Val Ly3 Gly Leu Ala Arg 
1 5 10 15 

Gly Val Phe Gin Axg val Thr Gly Ser Ala lie Thr Asp Leu Tyr Ser 
20 25 30 

Pro Lys Arg Leu Phe Ser Leu Thr Gly Asp Asp Cys Phe Gin Val Gly 
35 40 45 

Lys Val Ala Tyr Asp Met Giy Asp Tyr Tyr His Ala lie Pro Trp Leu 
50 55 60 

Glu Glu Ala Val Ser Leu Phe Arg Gly Ser Tyr Gly Glu Trp Lys Thr 
65 70 75 80 

Glu Asp Glu Ala Ser Leu Glu Asp Ala Leu Asp His Leu Ala Ph© Ala 
• 85 90 95 

-Tyr Phe Arg Ala Gly Asn Val Ser Cys Ala Leu Ser Leu Ser Arg Glu 
100 105. 110 

Phe Leu Leu Tyr Ser Pro Asp Asn Lys Arg Met Ala Arg Asn Val Leu 
115 • • 120 • 125 

Lys Tyr Glu Arg Leu Leu Ala Glu Ser Pro Asn His Val Val Ala Glu 
130 135 , 140 

Ala Val He Gin Arg Pro Asn He Pro His Leu Gin Thr Arg Asp Thr 
14 5 t 150 ' 155 160 

Tyr Glu Gly Leu Cys Gin Thr Leu Gly Ser Gin Pro Thr Leu Tyr Gin 
165 ' 170 175 

He Pry ser Leu Tyr Cys Ser Tyr Glu Thr Asn Ser Asn Ala Tyr Leu 
180 185 j ■ 190 

,Leu Leu Gin Pro lie Arg Lys Glu Val lie His Leu Glu Pro Tyr He , 
195 200 205 

Ala Leu Tyr His Asp Phe Val Ser Asp Ser Glu Ala Gin Lys He Arg 
210 ' 215 220 

Glu Leu Ala Glu Pro Trp Leu Gin Arg Ser Val Val Ala Ser Gly Glu , 
225 • '230 235 . 240 
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Lys Gin Leu Gin Val Glu Tyr Arg lie Ser Lya Scr Ala Trp Leu Lys 
245 . 250 255 • ' 

Asp Thr Val " Asp Leu Lys Leu Val Thr Leu Asn His Arg lie Ala Ala 
260 . 265., - 270 

„ Leu Thr Gly Leu Asp Val Arg Pro Pro Tyr Ala Glu Tyr Leu Gin Val 
275 280 285 ' 

Val Asn Tyr Gly lie Gly Gly His Tyr Glu Pro His Phe Asp His Ala 
290 L 295 • 300' 

Thr Ser Pro Ser Ser Pro Leu Tyr Arg Met Lys Ser Gly Asn Arg Val 
305 - 310 315 320. 

Ala Thr Phe Met He Tyr Leu Ser Ser Vol Glu Ala Gly Gly Ala Thr 
< 325 33Q 335 

Ala Phe He Tyr Ala Asn Leu Ser Val Pro Val Val Arg.* Asn Ala Ala 
340 . t 345 350 

Leu Phe Trp Trp Asn Leu His Arg Ser Gly Glu Gly Asp Ser Asp Thr 
, ' 355 360 365 . 

Leu His Ala Gly, Cys Pro Val Leu Val Gly Asp Lys Trp Val Ala Asn 
370 375 380 

■Lys Trp He His Glu Tyr Gly Gin Glu Phe Arg Arg Pro Cys Ser Ser 
385 . 390 395 400 

Ser Pro Glu Asp 



<210> 139 

<211> 96 

. <212> PRT ■ , r ■ t ' 

<213> Homo sapiens 

<220> 

t <221> SITE ' 

l <222> (96) , 

<223> Xaa. equals stop translation 

<400> 139 ■ 

Met Lys Ala Pro His Thr Gly Val Leu His Leu Gly Ser Val Trp Val 

1 • ' 5 t 10 ' 15 

Phe Leu Gly Pro Phe Leu Leu Gly Val Gly Tyr Thr I.eu Thr Phe Asn 
20 25 30 

Trc Leu Ser Gly Cys Met ser Thr Val Arg Trp Leu Asn Ser Asn He 
35 40 • 45 . 

Thr Ala Asn Arg Thr Leu Ser Arg Ser Vai Cys His Val Thr Pro Leu 



50 



55 60 



His Arg Ser Leu Ser Pro His Asp Gly Glu Tyr Leu Arq Gin Met Leu 



65 



70 75 BC 
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Leu Asn Ser Ser Ser Arg Ala Gly Glu Ala Gly Ser Trr> Gly Tyr Xaa 
65 : * 90 . * 95 



<210> 140 
<211> 240 
<212> PRT 

<213>.Homo 3apiens 
<220> 

<221> SITE . 
' <222> (240) 
<223> Xaa equals stop tiranslat ion. 

<400> 140 

Net Gly Ser Cys Ala Arg Leu Leu Lou Leu Trp Gly Cys Thr Val Val 
1 .5 10, 15 

Ala Ala Gly Leu Ser Gly Val Ala Gly Val Ser Ser Arg Cys Glu Lys 
20 .25 ( 30 

: Ala Cys Asn Pro Arg Met Gly 'Asn Leu A*a Leu Gly Arg Lys Leu Trp 
35 ,40 * 45 

Ala Asp Thr Thr Cys Gly Gin Asn Ala Thr Glu Leu Tyr Cys Phe Tyr 
50 . ' 55 SO 

Ser Glu Asn Thr Asp Leu Thr Cys Arg Gin Pro Lys Cys Asp Lys Cys 
65 70 .75 80 

Asn Ala Ala Tyr Pro His E.ftu Ala His , Leu Pro Ser Ala Met Ala Asp 

B5 90 - ,, . 95 

Ser Ser Phe Arg Phe Pro Arg Thr Trp Trp Gin Ser Ala Glu Asp Val 
100 105 110 

His Arg Glu Lys lie Gin Leu Asp Leu Glu Ala- Glu Phe Tyr Phe Thr 
115 120 • 125 

His Leu He Val Me- Phe Lys Ser Pro Arg Pro Ala Ala Met Val Leu 
• ' 130 , 135 140 

Asp Arg Ser Gin, Asp Phe Gly Lys Thr Trp Lys Pro Tyr Lys Tyr Phe 
145 150 155, 160 

Ala Thr Asn Cys Ser Ala Thr Phe Gly Leu Glu Asp Asp Val Val Lys 
165 ' 170 ' 175 

Lys Gly Ala He Cys Thr Ser Lys Tyr Ser Ser Pro Phe Pro Cys Thr 
,180 185 . 190 

Gly Arg Lys Val He Phe Lys Ala Leu Ser Pro Pro Tyr Asp Thr 31u 
195 200 . 205 

Ash Pro Tyr Ser Ala Lys Val Gin Glu Gin Leu' Lys He Thr Asn Leu 
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210 



215 



Pro Arg Ala Ala Ala Glu Thr Thr Val Leu Ser Leu Ser Glu Lys Xaa 
225 230 235- 240 



<210> 141 , 

<211> 54 - ' . 

<212> PRT • ( .. • 

<213> Homo sapiens 

<220>' ' ■•*..-■..* 

<221> SITE . -V 

<222> (54) ' » . . 

<223> Xaa equals stop translation 

<400> 141 

Met Met lie Ser Gly Leu Lys Leu Leu Val Leu Phe Leu Lys Phe Ala 
1 5 .10 15 

Pro Glu Asn Tyr Cys Leu Ser Thr Glu Thr Leu Gin Met, Pro Asn Arg 
20 25 ' 30 

His Leu Arg Leu Ser Lys Ala Thr Cys Tyr Lcii Met Ly3 Cys Leu Leu 
35 40 .45 

Pro Ser Tyr Phe Glu Xaa 

50 ' ' 



<210> 142 :', .. 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> ' . r 

<221> SITE 51 

<222> (67) 

<223> Xaa- equals stop translation 

<400> 142 ' . ' ' . ' 

Ket Arg Ser Leu lie Ser Ser His Pro Cys Gin His Leu Leu Leu Leu 
1 5 ■ 10 ' 15 

'Leu Leu Leu Leu Phe Leu 11* Leu Ala -lie Leu Val Asp Val> Lys Trp 
20 • 25 30 

Tyr Leu Val Leu Phe lie Cys lie Sar Leu Met Thr Ser Asp Val Glu 
35 40 45 ! 

His Leu Phe Met Cys Leu Leu Ala He Arg lie Ser Ser Trp Arq Asn 

50 55 Ji 60 " 

Val Tyr Xaa' 
65 
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<210> 143 

<211> 108 * , 

<212> PRT 

<213> Homo sapiens 

<220> .. ' ■ 

<221> SITE '• 
' <222> (48) 

<223> Xaa equals any of the naturally occurring L -ami no acids 

<220> > 

<221> SITE . 

. <222> (55) . ' 

<22 3> Xaa equals any cf the naturally occurring L -amino acids 

<22C> ' 
■<221> SITE 
<222> (58) 

<223> Xaa equals any of the naturally occurring F.-aminrv acids 

<220> • 
<221> SITE 
<222> (67) 

<223> Xaa equals any of the. naturally occurring L-amino acids 
<400> 143 

Leu Thr Leu lie Leu Cys Glu Leu Ser val Ala Gly* 
5 ' 10 15 

Ala Ser Gin Arg Pro Leu Gin Arg Leu Pro Arg Hie 
• 25 30 

Arg Asn Pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa. 
40 45 

Trp Gly Xaa Pro Asp Xaa Gin Phe Pro Gly' Cys Pro 
55 60 

Val Thr Leu Asn Ala Arg Gin Met 'Gly Asn Gly Lys 
,70 75 . 80 

Ala Asp Leu Lys Leu Lys Phe Pro Gin Lys Arg Phe 

,85 90 ..95 

Phe Ser Glu Arg lie Lys Ala Phe 
105 



<210> 144 

<211> 84 , " 

<212> PP.T 

<213> Homo sapiens 

<220> ' .. ( - 
<221> SITE 
<222> (84) 

<223> Xaa equals step translation 



Met Phe Tyr Lys, 
1 



Val Thr Gin Ala 
20 



He Cys Ser Gin 
35 

Leu 'Gin Thr Thr 
50, , 

His Pro Xaa Arg 
-65 



Glu Ly3 Lys Ala 



Tyr Leu Ser Ala 
100 
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<400> 144 

Met Ala Ser Val. Gly Thr Thr Leu Val Ser Pro Leu Leu Cys Leu Leu 
1 • ' 5 10 15 

lie Pro Thr Arg .Val Ser Asp Pro Trp Leu Gin Asn Thr Pro Leu His 

20" ■'■ 25 • 30' 

Pro Trp Lys Thr lie Thr lie lie Asp Tyr Tyr Leu Ser Leu Gly Phe 
35 40 45 

Leu Gly Trp Thr Gly Leu Ser Trp Val Val His Phe Gly Ala Ser Ala. 
50 55 60 

Val Met Gly Arg Gin Trp Leu Gly Ser Leu Gin Arg Leu Pro, Cys lie 
65 70 75 . ' 80 

Ser Gly Ser Xaa . ' 



<210> 145 • 
<211> 166 
<212> PRT 

<213> Homo sapiens , 
<400> 145 . ' ■ ' 

Met Gly Ser Arg Phe Leu Leu Val Leu Leu . Ser Gly Leu Thr Val Leu 
1 5 ,10 -15 

Leu Ala Leu Pro Gly Ser Glu Ala Lys Asn Ser Gly- Ala Ser Cys Pro 
20 25 30 

Pro Cys Pro Lys Tyr Ala Ser Cys His Asn Ser Thr His Cys Thr Cys 
35 40 45 

Glu Asp Gly Phe Arg Ala Arg Ser Gly Arg Thr Tyr Phe His Asp Ser 
50 55 60 ' » _ 

Ser Glu Lys cys Glu Asp, He Asn Glu Cys Glu Thr Gly Leu Ala Lys 
65 70 75 80 . 

Cys. Lys Tyr Lys Ala Tyr Cys Arg Asr. Lys Val Gly Gly Tyr lie Cys 
'85 90 . 95 

Ser Cys Leu Val Lys Tyr Thr Leu Phe Asn Phe Leu Ala Gly lie He 
100 105 ' 110 

Asp Tyr Asp His Pro Asp Cys 'iyr Glu Asn Asn Ser Gin Gly Thr Thr 
115 120 125 

Glh Ser Asn Val Asp lie Trp Val Ser Gly Val Lys Pro Gly Phe Gly 
130 135 140 

Lys Gin Leu Val Arg He Thr Met Pro Phe Ser Tyr Pro Asn He Asn 
145 - ISO • , 155 ' " ' 160 

Met Ser Ser Cys Asp Phe ' 
165 
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<210> 146 ■ -' 

<211> 70 * 
<212> PRT 

<213> Homo sapiens 
<400> 146 

Met Lys Pro Lys His Leu Glu Trp Cys Leu Ala His aer Trp Cys Val 
1 5 ■ 10 ' ( 15 

He- Trp Leu Ser Phe Val Ser Pro Pro Thr . Ser His Leu Glu Cys Asp 
20 25 30 

Gly Phe Pro Gly Ser Leu Leu Pro Pro Cys Glu Glu Gly Arg Cys Phe 
35 40 45 

Pro Phe Thr Phe His' His His Asp Cys His Gly Cys Ser Pro Leu Gin 
50 55 t , 60 

Ser Ser Pro Gly Gin His - 
65 70 



<210> 147 

<211> 412 

<212> PRT 

<213> Homo sapiens 

<400> 147 

Met Cys Cys Trp Pro Leu Leu Leu Leu Trp Gly Leu Leu Pro Gly Thr 
l . .5 10 15 

Ala Ala Gly, Gly Ser Gly Arg Thr Tyr Pro His Arg Thr Leu Leu Asp 
'"20 25 3C 

Ser Glu Gly Lys Tyr Trp Leu Gly Trp Ser Gin Arg Gly Ser Gin He 
35 • 40 45 

\Ala Phe Arg ? Leu Gin Val Arg Thr Ala Gly Tyr Val Cly Phe Gly Phe 
50 " ' 55 60 

Ser Pro Thr Gly Ala Met Ala Ser Ala Asp He Val Val Gly Gly Val 
. 65 70 75 > * 80 

Ala His Gly Arg Pro Tyr Leu Gin Asp Tyr Phe Thr Asn Ala Asn Arc 

• 85 90 ; ■ 95 ' * " 

Glu Leu Lys Lys Asp Ala Gin Gin Asp Tyr His Leu Glu Tyx Ala Met 

' "100 105 , . ■ HO 

Glu Asn Ser Thr His Thr He He Clu Phe Thr Arg Glu Leu His Thr 
115 ^ 120 '125 

Cys Asp He Asn Asp Lys Ser He Thr Asp Ser Thr Val Ara Val Tie 
130 135 140 

Trp Ala Tyr His His Glu Asp Ala Gly Glu Ala Gly Pro Lys Tyr His 
145 



150 - , 155 . 160 
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Asp Ser Asn Arg- Gly Thr lys Ser Leu Arg Leu Leu Asn Pro Glu Lys 
1 165 170 175 ' 

'* ■._' '■ ■. ' 

Thr Ser Val Leu Ser. Thr Ala Leu Pro Tyr Phe Asp Leu Val Asn Gin 
180 185 ' 190 

Asp yal Pro lie Pro Asn Lys Asp' Thr Thr Tyr Trp Cys Gin Met Phe 
195 200 205 

Lys lie Pro Val Phe Gin Glu Lys His Kis Val He Lys Val Glu Pro 
, 210 ' 215 220 

Val He Gin Arg Gly His Glu Ser Leu Vol His His He Leu Leu Tyr 
225 . 230 ' 235 240 

Gin Cys Ser Asn Asn Phe Asn Asp Ser Val Leu Glu Ser Gly His Glu 

245 . . 250 ' _ 255 ,-■ 

Cys Tyr Kis Pro Asn Met Fro Asp Ala Phe Leu Thr Cys Glu Thr Val 
'260 265 270 

lie Phe Ala' Trp Ala He Gly Gly Glu Gly ?he Scr Tyr Pro Pro His 
275 280 285 

Val Gly Leu Ser Leu Gly Thr Pro Leu Asp Pro His Tyr Val Leu Leu ' 
290 295 300 

Glu Val His Tyr Asp Asn Pro Thr Tyr Glu Glu Gly Leu He Asp Asn 
30* " 310 • 315 ■ 320 

Ser Gly Leu Arg Leu Phe Tyr Thr Met Asp Tie Arg Lys Tyr Asp Ala 
325 330 - 335 

Gly Val He Glu Ala Gly Leu Trp Val' Ser Leu Phe His Thr lie Pro 
.. 340 345 350 

Pro* Gly Met Pro Glu Phe Gin Ser Glu Gly His Cys Thr Leu Glu Cys 
355 ' ■'. 360 ' 365 

Leu Glu Glu Leu Trp Lys Pro Lys Ser 'Gin Val Glu Phe Met Cys Leu 
370 375 380 

Leu Phe Phe Ser Met Leu Thr Trp Leu Ala Glu His Gin Ala Ala Ser 
385 390 . 395 400 

Phe Ser Lys Arg Glu Gly Asn Glu tie Thr Cys Leu 
405 410 



<210> 148 : 

<211> B5 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
■ <222> , (85) , ■ 
<223> Xao equals stop translation 
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<400> 148 * 

Met Asn Val Phe. Leu Pro Pro Ala Leu Gly Thr Trp Gly Val Ala Arg 
1 5 10 * 15 , 

Phe Phe" Pro His Leu Val Pro Glu Arg Trp Cys Leu Val Phe Cys' Cys 
20 25 30 

Trp lie Phe Phe Phe Phe Phe Phe Phe Cys Thx Lys Val Ala Thr Arg 
35 • . 40 45 

Ser Val Leu Gly Asp Gin Ala Gly ,Leu Gly Val Gly Gly Pro His Leu- 
• 50 55 60 ' 

Pro Leu Pro Gly Ser His Ser Val Ser Val Pro Glu Lys Thr lie Phe 
65 70 75 80 

Ser Leu Lys Gin Xaa 
'85 



<210> 149 . . 
<211> 154' ■ ' 
<212> PRT 

<213> Ho hio sapiens 

<220> ' "• ' • ' 

<221> SITE 
<222> (154) 

<223> Xaa equals stop translation , r ' 

<400> 149 

Met Gly Arg Leu Pro Leu Leu Arg Arg Val Leu Lys Gly Leu Gin Leu 
1 5 , 10 15 

Leu -Leu Ser Leu Leu Ala Phe lie Cys Glu Glu Val Val Ser Gin Cys 
20 25 30 - 

Thr Leu Cys Gly Gly Leu Tyr Phe Phe Glu Phe Val Ser Cys Ser Ala 
35 _ 40 45 ; 

Phe Leu Leu Ser Leu Leu lie Leu lie Val Tyr Cys Thr Pro Phe Tyr 

• 50 55 60 . * 

Glu Arg val Asp Thr Thr Lys Val Lys Ser Ser Asp Phe Tyr lie Thr 

65 70 J - 75 B0 \, 

Leu Gly Thr Gly Cys Val . Phe Leu Leu Ala Ser lie lie Phe Val Ser 
85 ' 90 95 

Thr His Asp Arg Thr Ser Ala Glu, He Ala Ala He Val Phe Gly Phe 
100 105 . 110 

He Ala Ser Phe Met Phe Leu Leu Asp Phe lie Thr Mot Leu Tyr Glu ^ 
-115 120 125 

Lys Arg Gin Glu Ser Gin Leu Arg Lyo * Pro Glu A3n Thr Thr Arg Ala « 

130 135 140 i 

Glu Ala Leu Thr Glu Pro Leu Asn* Ala Xaa . 
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145. 



<210> ISO 
<211> 130 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (130) 

<223>. Xaa equals stop translation 



Pro Ala Leu Ser Gly Leu Ala Ser 
' 10 15 

Ala Thr Phe Ser Ala Gin Leu Pro' Gly Pro Val Ala 
25 - .30 



,<400> 150 
Met Arg Gly His 
1 

- Val Cys' Leu Trp 
20 

Ala Thr- Ser Trp 
35 

Gly Pro Glu Lys 
50 



Leu Ala Gin Ala 
65 



. Pro Ala Pro Ala 



Leu Ph«* Asp Gly 
100 

Ser Thx Asp Val 
115 

Glu Xaa 
130 



Leu Ala Gly Phe 

5 



Thr Pro Ala Pro 
40 



-eii Met Asp Thr 
120 



Leu Gly Cys Ser 



Arg Glu Ala Giu 



Ala Ala Arg Ser 
45 

Ser Ala Ala Ser 



Leu Pro Val Asp 
83 

i 

Trp Leu Gly Gly 
95 

Phe Leu Arg Lys 
110 

Ser ' Leu Glu Val 
125, 



Arg Leu Gly Thr Ala Ala Pro Gly 
55 ' 60 

Gly Pro Gly Ala Pro Cys Arg, Val 
70 ., 75 

Ala Leu Asn Val Arg Glu Pro Gly 

85 90., 

Ala Leu' Leu Gin Val Leu Leu -Asn 
105 



<210> 151 

<211> 62 , ' 

<212> PRT 

<213> Homo sapiens 

<220> ' ' \ ' ' ' . ' 

<221> SITE 

<222> (62) ■ * 

<223> Xaa equals step translation 

<400> 151 

Met Leu Phe Trp Ala Tyr Pro lie. Cys Val Phe lie Asp Ser Leu Ser 
1 5 10 15 



Cys Gin Pro Cys Leu Trp Ser Thr Gly Ala Thr Ser His Phe Asn Ser 

20 .25 '30 
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Pro Thr Thr Ser Pro Leu The Thr l Leu Phe Met Pro Cys Ala Leu Ala 
35 40 45 



. Pro 'Asn^ Pro Phe. Thr, Gin Leu Gly Lys Leu Asp Asp Arg Xaa 
50 55 60 



<2i0> 152 
<211> 225 

<212> PRT f , 

<213> Homo sapiens ' 

<220> ' ' ' 

<221> SITE 
<222> (225) 

<223> Xaa equals stop translation 
<400> 152 

Met Gly lie Phe Pro Gly He He Leu He Phe Leu Arg Val Lys Phe 

] ' , S\ ' 10 15 . 

Ala Thr Ala Ala val He Val Ser Gly His Gin Lys Ser Thr Thr Val 
20 25 30 ■ 

Ser His Glu Met Ser Gly Leu Asn Trp Lys Pro Phe Val Tyr Gly Gly 
35 . . 40 45 

Leu Ala Ser He Val Ala Glu Phe Gly Thr Fhe Pro Val Asp Leu Thr 
50 55 60 

Lys Thr Arg Leu Glh Val Gin Gly Gin Ser lie Asp Ala Arg 'Phe Lys 
65 70 ; '75 80 

Glu He Lys Tyr Arg Gly Met Phe His- Ala Leu Phe Arg He Cys Lys 
' -85 90 ■ 95 

Glu. Glu Gly Val Leu , Ala Leu Tyr Ser Gly lie Ala Pro Ala Leu Leu. 
100 ' 105 ~ 110 

Arg Glh Ala Ser Tyr Gly ..Thr He Lys He Gly He Tyr Gin Ser Leu 
US 120 125 . 

Lys Arg Leu Phe Val Glu Arg Leu Glu Asp Glu Thr Leu Leu lie Asn 
130 ■ 135 140 

Met lie, Cyp Gly Val- Val' Ser Gly' Val: He .Sex Ser Thr He Ala Asn 
145 150 . ■ " 'l55 160 

Pro Thr Asp Val Leu Lys He Arg Met Gin Ala Gin Gly Ser Leu Phe 
165 ' 170 175 

Gin -Gly Ser Met He Gly Ser Phe lie Asp He Tyr Gin Gin Glu Gly 

' ' ' 180 ( ' 185 - - ,190 

Thr Arg Gly Leu Trp Arg Val Ser Thr Leu Phe Leu Leu Leu Ser Tyr 

195 200 , . '205 

Thr Leu Ser Ser Tyr Asn Leu Gin .Arg lie Phe Phe Tyr He Lys Thr 

' 210 215 * : • 220 
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Xaa 
225 



<210> 153 ' 

<211> 69 

<212> PRT 

<213> Homo sapiens 

<220> 1 . •• ' - 

<221> SITE , 
<222> (€9) 

<223> Xaa equals stop translation 

<400>153 ' . ' " 

Met Leu Met Leu Leu Thr Leu Leu Val Leu Gly Met Vctl Trp Val Ala 

• 1 .5 10 " 15 

Ser Ala He Val Asp Lys Asn Lys Ala Asn Arg Glu Ser Leu Tyr Asp 
20 25 30 

Phe Trp Glu Tyr Tyr Leu Pro Tyr Leu Tyr Ser Cys He Ser Phe Leu 
• 35 40 45 

Gly Val Leu Leu' Leu Leu Ala Ala Gly Arg pro Gly Gly Ala Ala Val 
'50 55 60 

Leu Leu Ser Leu Xaa 
65 



<210> 154 

<211> 84 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (84) 

<223> Xaa equals stop translation 
<430> 154 

Met Tyr Gly val Cys Leu Cys Val He 
1 5 

Leu Cys Leu Tyr Val Trp Gly Val Ser 
20 25 

Phe Met Cys Val Cys Leu Cys Val' lie 
" 35 • 40 

Arg Thr Glu His Tyr His Ser Pro His 
50 55 

Arg Glu Met Cys Gly Arg His Asp Val 
65 70 

Ser Tyr Val Xaa 



Val Cys Val Ser Gly Val Ser 
10 15 

Val Cys Asp Cys Val Ser Val 
30 

Phe Cys Val Tyr Gly Lys Pro 
45 

Leu Ala Lys Gin Lys Ala Phe 
60 

Ser Ala Ala Gly Ily Phe Gin 
75 80 
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'<21.0> 


155 




<211> 


61 




<212> 


PRT 




, <213> 


Homo sapiens 




,. <220> 






<221> 


SITE 




<222> 


(61) . 




<221> 


Xaa equals stop translation 




<400> 


155 




Met His Val Leu Leu Phe Ser Phe Leu 


lie 


1 


5 


10 



Pro Val Cly Val Thr Cys Asn Ser His Met Leu Glu Arg Gin Val Ser 
20 1 25 • • 30 

Trp Leu Lys Lys Arg Ser Thr Gin Ala Ser Gin Gin Phe Asn Lys Phe 
35 40 45 

Leu Arg Gly He Ser Asn Val Gly Arg He Val lie Xaa 

■50 ■ ■ * 55 ' 60 

<210> 156 ' • 

<211> 84 • . 
<212> PRT ' ." 
<213> Homo sapiens 

- <22o> • _ . ; ; • 

,<221> SITE ' -' 
<222>'(B4) 

<223> Xaa equals stop translation 
<400> 156 - ' 

Met Cys Leu Leu Val Glu Tyr Ser Leu Met He Leu Thr He lie Pro 
1 5 " 10 15 

'Ser Leu Leu Ser Phe Val Leu Cys Leu Lys Gly lie Lys His Gly Asn 
20 25 1 30 

Tyr. He Phe Gin Thr Pro Leu Pro Glu Gly Tyr Gly Trp Tie Ser Ala 
35 40 45 

Met Ser Gly Leu Cys lie Lys Phe Gly Arg Arg Lys Arg Arg Lys Thr 

50 55 ■ 60 . • ' 

Trp Leu Leu Gin Val Gly Thr Leu Ala' Thr He Asp Thr Glu Phe Ala 

65 70 75 •. •• . 80 

Arg ' Ser Cys Xaa ■ . 



<210> 157 
<21i> 162 
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<212> PRT 

<213> Homo sapiens'' 

. i ' ; ■ , t . . 

<220> .*."", 

<221> SITE ' ( ' " 

<222> (162) - 

<223> Xaa equals, stop translation 

<400> 157 

Met Ala Leu Ser Leu Thr Leu Cys ?he Val Met Phe Trp Thr Pro Asn 
1 . 5 '•' 10 15 . 

Val Ser Glu Lys .He Leu lie Asp lie He f Gly Val Asp Phe Ala Phe 
20 25-30 

Ala Glu Leu Cys Val Val Pro Leu Arg He Phe Ser Phe Phe Pro Val 
35 40 ■ 45 

Fro Val Thr Val Arg Ala His Leu Thr Gly Trp Leu Met Thr Leu Lys 

50 55 60 ' 

Lys Thr Phe Val Leu Ala Pro Ser Ser Val Leu Arg He lie Val Leu 
65 ^ 70 75 80 ' 

He Ala Ser Leu Val: Val Leu Pro Tyr Leu Gly Val His G3y Ala Thr 
85 90 ' 95 

Leu Gly Val Gly: Ser Leu Leu, Ala Gly Phe val Gly Glu Ser Thr Met 
,." 100 105 110 

Val Ala Tie Ala Ala Cys Tyr Val Tyr Arg Lys Gin Lyc Lys Lys Met 

.120 125 

Glu Asn Glu Ser Ala Thr Glu Gly Glu.' Asp Ser Ala Met Thr' Asp Met 
.. 130 . 135 140 

Pro Pro Thr Glu Glu Val Thr Asp He Val Glu Met Arg Glu Glu Asn 
145 150 155 160 

Glu Xaa 



<210> 158 
<211> 146 
<212> PRT 

<213> Homo sapiens ' 
<220> 

<221> SITE . , 

,<222> (96) 

<223> Xae equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE ' 
<222> U07) ' . . •' 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<220> 
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' -> • * ' ' ' . * ; 112 > ' > : " - . V. 

<221> SITE . ' 

■ <222> ' mi.) *. ' ' , . ; ' . 

<223> Xaa equals any of the naturally occurring L-amino acids, • '' 

<22o> ... - " ; . ' , ' ■ . . 

<221> SITE . • . ,' . ;■ , . - ' 1.. 

; <222> (115) • ; , •. \ _ - " \ ' , 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> " : \ ' • " •. 

. <221> .SITE , • ' ' ' *■ : . ' 

• <222> (122) ' . ■ , . " • - ' , ' s ... , 

<223> Xaa equals any of the .naturally occurring L-amino acids ... 

<220> ..• ' ' , • • . ' - 

<22i> site , •' .• . / '" " ; ■ ■ . • , •• * ■• 

<■ <222> (132) ^ ' ' • - ' 

<223> Xaa ecualsany of the naturally occurring L-amino acids " , 

<400> 158 ' ' ' ' " . ' , ' 

Met Glu Pro Gin Leu Gly Pro Glu Ala Ala Ala' -Leu Arg Pro Gly. Trp 
," H 1 5 10 15 . 

; t Leu Ala Leu Leu Leu Trp Val Ser Ala Leu Ser Cys Ser Phe Ser.Leu ' 
- 20 25 30 

Pro Ala Ser Ser Leu Ser Ser Leu Val Fro Gin Val Arg Thr Ser Tyr -. 

• t; 35 . ■ ^ ■ 40 : 45 ■* 

. . "■ ' ■ ' '* ' . . , - . t * . . 

Asn Phe Gly Arg Thr Phe Leu Gly Leu Asp Lys Cys Asn Ala Cys He 

, .50. 55 . * 60 , \ ' ' , ' ' . 

I ,•*-'.,. ' 1 - 1 * 

Gly Thr ser lie Cys Lys- Lys Phe Phe Lys Glu Glu lie Arg Ser Asp 

65 - , . ; .70 ^ " • ■ 75 AO ■ 

Asn Trp Leu Ala Ser His Leu Gly Thr Ala Ser Arg Phe", Pro Leu Xaa /" . ' : 

ss ; .90. ■ ■ : .• 95 ' i; •* • 

Ser Tyr- Pro Cys Lys .Leu Leu Gin Met lie Xaa* Lys lie Trp Xaa Pro/ 

100 . 105 110 - V , 

Cys Gly Xaa Leu Leu Thr Gly Gin Gin Xaa Ser Asn Glu ' lie Ser Lys > .» 
115 ' 120 * 125' ■ < . 

Gin Glu lie Xaa Cys Leu Leu His Pro Pro Pro Lys Asn Leu His He ' -■.» 

130 135 ■ 140 .;' 1 '■ . 

Asp val • • '• • - ' 

145 - " " ■ . ' ••• •' ' • • , . - 



*<21u» 159 1 ' 
; <211> 143 !; , 

<212> PRT 

<213> Homo sapiens 

<220> 
■ <221> SITE 
<222> (143) 
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<223> Xaa equals stop translation ; 

<400> 159 . . • 

Met Trp Trp Ala Val Met Gly Gly Val lie Gly Ser Trp Leu Ser Pro 

10 ' ' 15 , * 

Leu »Ser lie Ala Glu Cys Cys His Asp Leu Trp Thx Ser Gin Ser Cys 
20 25 30 

" , . ' ' ' ■ ' " 

Glu His Ala Gly Ala Leu Cys Gly Asp .Leu Leu Cys Ala Cys Arc Lys 
•35 - 40 /* 45 

Val Gly Val Trp Cys Ala Leu Glh Gin His Trp Trp Asn Arg Cys Val 
' 50 55 60 

Cys Pro His Ala Val lie Arg Val Hi3 Cys- Thr Gly Ala Ser Tyr Thr 
65 70 ' t 75 80 

Leu Gin Lys He Cys Ser Cys Asn Pro Lys 1 Phe Met Gly Arg His Pro 
65 90 95 

His Arg Trp Gin Gin He Arg Lys Cys Ser Gin Pro Val Leu Arg Gly 
100 . . 105 ' 110 

Ser Arg Ala Ala Phe He Trp Val Arg Leu Ala Ala Leu Asn Phe He 
115 120 125 

Ser Ser Phe Arg cys He ser Leu He Ser Tyr. Ser Ala Phe Xaa 
130 135 140 



<210> 160 

<211> 51 - 

<212> PRT . • ' ; ' . . 

<213> Homo sapiens*-^ . 

<220> v . • ' 

<221> SITE * 
<222> (51J • . ' 

<223> Xaa equals stop translation. f 

<400> 160 

Met Lys Val Ser Asp Phe Asn Phe Leu He Phe Leu He Phe Ala Leu 
1 5 10 ' ' 15 

Phe Leu Thr Leu Glu Ala Phe Leu Lys Phe Thr Lys Arg Vol Leii Ala 
20 25 30 

val Val Gly Ash Leu Pro Glu Pro Pro lie lie Lys Thr lie G.ly Phe. 
35 * 40 45 

Leu Tyr Xaa ' ■ . ■ ' , 

50 ' . . ' 



<210> 161 

<211> 65 

<212> PRT / 

<213> Homo sapiens 
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<220> ■ 
<221> SITE 

<222> (65) , 
<223> Xaa equals stop translation 

<400> 161 

Met Val Trp ser Ala Ala Pro Ala Pro Cys Cys Leu Leu Gly Val Leu 
1 5 . 10 15 

Gly Leu' Val Gin Val Leu Gly Ala Gin, Ala Val Gly Pro Trp Thr Ala 
2 0 25 30 

Ser Ala Cys Leu Gly Ala Ala Gin Ala Gin Pro Cys Ara Pro Cys Lys 
35 40 45 

Glu Ser Ser Leu, Arg Leu Phe Ser ' Ala Ser Ala Pro Ser Met Thr His 
50 55 60 

Xaa / 

65 , • 



<210> 162 
<211> 59 . 

<212> PRT ' * 

<213> Homo sapiens 

<220>' > 
<221> SITE 
<222> (59) 

<223> Xaa equals stop translation 
<400> 162 

• Met Glu Lys Tyr Cys Leu Gly Asn Asn Met Leu Ser Arg Phe Cys Leu 
1 5. 10 15 ' 

Phe Leu lie 'Met Leu Leu His lie Leu Leu Phe Leu Val lie Phe He 
20 25 30 

Gin Arg His Thr Val Val SeryLeu Ser Lys His His Pro Phe Val Pro 
35 » ' 40 45 

3 Thr Asn Gly Ser Lys Ser Tyr. Ser Ser Phe Xaa 
50 55 



<210> 163 • 

<211> 374 • ' ' - 

<212> PRT , 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> <94> ' ' - 

<223> Xoa equals any of the naturally occurring L-amino Sj acids 

<220> • • . 

<221> SITE ' 
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<222> (112) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 163 

Met Arg Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val Leu Leu 
1 -5 . 10 .15 

Val Gly Levi Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Gin Pro Val 
20 25 ,30 

Cys Arg Gly Gly Thr Gin Arg Pro Cys Tyr Lys Val lie Tyr Phe His 

,35 •• 40 .4 5 

Asp Thr Ser Arg Arg Leu Asn Phe Glu Glu Ala Lys Glu Ala Cys Arg 
50 55 60 

Arg Asp Gly Gly Gin Leu Val Ser lie Glu Ser Glu Asp Glu Gin Lys 
65 70 75 .30 

Leu He Glu Xaa Phe lie Glu Asn Leu Leu Pro Ser Asp Gly Asp Phe 
85 90 95 

Trp lie Gly Leu Arg Arg Arg Glu Glu Lys Gin Ser Asn Ser Thr Xaa 
100 105 110 

Cys Gin Asp Leu Tyr Ala Trp Thr Asp Gly Ser lie Ser Gin Phe Arg 
115 ,120 125 

Asn Trp Tyr Val Asp Glu Pro Ser Cys Gly Ser Glu Val Cys Val Val 
130 135 140 

Met Tyr His Gin Pro Ser Ala Pro Ala Gly He Gly Gly Pro Tyr Met 
145 150 155 160 

Phe Gin Trp Asn Asp Asp Arg Cys Asr. Met Lys Asn Asn Phe lie Cys 
' 165 . 170 175 

Lys Tyr Ser Asp Glu Lys Pro Ala Val Pro Ser Arg Glu Ala Glu Gly 
180 ' 185 190 

Glu Glu Thr Glu Leu Thr Thr' Pro Val Leu Pro Glu Glu Thr Gin Glu 
195 200 ' 205 

Glu Asp Ala Lys Lys Thr Phe Lys Glu Ser Arg Glu Ala Ala Leu Asn 
210 215 220 

Leu Ala Tyr He Leu lie Pro Ser He Pro Leu Leu Leu Leu Leu Val 
225 230 235 . . 240 

Val Thr Thr Val - Val Cys Trp Val Trp' He Cys Arg Lys Arg Lys Arg 
245 250 ■ 255 

Glu Gin Pro Asp Pro Ser Thr Lys Lys Gin His Thr He Trp Pro Ser 
260 265 270 

Pro His Gin Gly Asn Ser Pro Asp Lsu Glu Val Tyr Asn val He Arg 
275 ' 280 '285 

Lys Gin Ser Glu Ala Asp Leu Ala Glu Thr Arg Pro Asp Leu Lys Asn 
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290 ' 295 300 

lie Ser Phe Arg Val Cys Ser Gly Glu Ala Thr Pro Asp Asp Met Ser 
305 310 315 . . 320 

Cys Asp Tyr Asp Asn Met Ala Val Asn' Pro Ser Glu Ser Gly, Phe Val 
325 330 ' 335 

Thr Leu' Val 'Ser Val Glu Ser Gly Phe Val Thr Asn Asp He Tyr Glu 
340 ■ ■• 345 350 

Phe Ser Pro Asp Gin Met Gly Arg Ser Lys Glu Ser Gly Trp Val Glu 
355 ■ 360 365 

Asn Glu He Tyr Gly Tyr 
370 ' 



<210> 164 ' 

<211> 64 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals stop translation 
<400> 164 ' 

Met His Pro Gin Leu He Pro Ser Val He Ala Val Val Phe Ile.Uu 

1 5 . 10 15 

i ■ 
I . ■ . 

. Leu Leu Gly Val Cys Phe He Ala Ser Cys Leu Val Thr His His Asn 

20 25 * ' 30 , 

Phe Ser Arg Cys Lys Arg Gly Thr Gly Val His Lys Leu Glu His His 
'35 4 0 45 

Ala Lys Leu Lys Cys lie Lys Glu Lys Ser Glu Leu Lys Ser Cys Xaa 
,50 55 60 



<210> 165 
<211> 743 
<-212> PRT 

<213> Homo sapiens 

<220> V ' ' ; . , 

<221>. SITE 

<222> (743) 4 / . . r 

<223> Xaa Equals stop translation' 

<400> 1S5 

Met Ala Val Arg Glu Leu Cys Phe Pro Arg Gin Arg Gin Val Leu Phe 
1 5 10 ... 15 
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Leu Phe Leu Phe Trp Gly Val Ser Leu Ala Gly- Ser Gly Phe Gly Arg 
20 25 30 

Tyr Ser Val Thr Glu Glu Thr Glu Lys Gly Ser Phe Val Val Asn Leu 
35 \ 40 45 

Ala Lys Asp Leu Gly Leu Ala Glu Gly Glu Leu Ala Ala Arg Gly Thr 
50 .55 60 ■' 

Arg Val Val Ser Asp Asp Asn Lys Gin Tyr Leu Leu Leu Asp Ser His 
65 70 75 80 

Thr Gly Asn Leu Leu Thr Asn Glu Lys Leu Asp Arg Glu Lys Leu Cys 
85 t 90 95 

Gly Pro Lys Glu Pro Cys 'Met' Leu Tyr Phe Gin Tie Leu Met Asp Asp 
100 105 110 

Pro Phe Girt lie Tyr Arg Ala Glu Leu Arg Val Arg Asp lie Asn Asp 
115 120 ,125 

His Ala Pro Val Phe Gin Asp Lys Glu Thr Val J Leu Lys lie Ser Glu.' 
130 135 140 

Asn Thr Ala Glu Gly Thr, Ala Phe Arg Leu Glu r Arg Ala .Glh Asp Pro 
145 .• - 150 155 , . 160 

Asp Gly Gly Leu Asn Gly lie Gin ' Asn Tyr Thr lie Ser Pro Asn Ser 
,165 170 175 

Phe Phe His lie Asn He Ser Gly Gly Asp Glu Gly Met lie Tyr , Pro 
180 185 190 

Glu Leu Val Leu Asp Lys Ala Leu Asp Arg Glu Glu Gin Gly Glu Leu 
195 200 205 

Ser Leu Thr Leu Thr Ala Leu Asp Gly Gly Ser. Pro Ser Arg Ser Gly 
210 215 220 

Thr ser Thr Val Arg lie Val Val- Leu Asp Val Asn Asp Asn Ala Pro 
225 230 235 240 

Gin Phe Ala Gin Ala Leu Tyr Glu Thr, Gin Ala Pro Glu Asn Ser Pro 
245 250 255 

lie Gly Phe Leu lie Val Lys V*1 Trp Ala Glu Asp Val Asp Ser Gly 
260 265 270 

Val Asn Ala Glu* Val Ser Tyr Ser Phe Phe Asp Ala Ser Glu Asn lie 
275 .280 285: 

Arg Thr Thr Phe Gin lie Asn Pro Phe Ser Gly, Glu lie Phe Leu Arg- 
290 ' 295 . ' 300 

Glu Leu Leu Asp Tyr Glu Leu Val Asn Ser Tyr Lys lie Asn lie Gin 
3 °5 ,. 310 315 320 

Ala Met Asp Gly Gly Gly Leu Ser Ala Arg Cys Arg Val Leu Val Glu 
325 330 335 
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Val Leu Asp Thr Asn Asp Asn Pro Pro Glu Leu He Val Ser Ser Phe 
340 345 350 

Ser Asn Ser val Ala Glu Asn Ser Pro Glu Thr Pro Leu Ala Val Phe 

355 . 360 * ' 365 v . 

Lys lie Asn Asp Arg Asp Ser Gly Glu Asn Gly Lys Met Val Cys iyr 
. 370 375 380 

lie Gin Glu Asn Leu Pro Phe Leu Leu Lys Pro Ser Val Glu Asn Phe ■ 
38 5 390 395 ' COO 

Tyr He- Leu He Thr Glu Gly Ala Leu Asp Arg Glu He Arg Ala Glu 
405 • . 410 415 

Tyr Asn He Thr He Thr Val Thr Asp Leu Gly Thr Pro Arg Leu Lys i 
420 ' 425 430, 

Thr Glu His Asn He Thr Val Leu Val Ser Asp Val Asn .Asn Asn Ala 
•: 435 440 • 445 

Pro Ala* Phe Thr Gin Thr Ser Tyr Thr Leu Phe Val Arg Glu Asn Asn 
450 .455 ■ 460 

Ser Pro Ala Leu His lie Gly Ser Val Ser Ala Thr Asp Arg\sp Ser 
** 65 470 475 480 

Gly Thr Asn Ala Gin Val Thr. Tyr Ser Leu Leu Pro Pro Gin Asp Pro 
485 490 495 

His Leu Pro Leu Ala Ser Leu Val Ser lie Asn Ala Asp Asn Gly His . 

500 •'• ' . 505 • 510 

Leu Phe Ala Leu Arg Ser Leu Asp Tyr Glu Ala Leu Gin Ala Phe Glu 
h . 515 520 ' . 525 

Phe Axg Val Gly Ala Thr Asp Ary Giy Ser Pro Ala Leu Asn Ser Glu 
* 30 - 535 540 " 

Ala Leu Gly Ala Arg Ala Gly Ala Gly Arg Gin Arg Gin Leu Ala Leu 
545 . 550 55S 560 

Arg Ala Val Pro Ala Ala Glu Arq Leu Arg Ala Leu His Arg Ala Gly 
565 570 575 

Ala Pro Gly Gly Arg Ala Gly Leu Pro Gly Asp Gin Gly Gly Gly Gly 
580 585 '590 

Gly Arg Arg Leu Gly Pro Glu Arg Leu Ala Val Val Pro Ala Ala Gin 
,. 595 600 -605 

Gly His Gly Ala Arg Ala yal Arg Cys Val Gly Ala Gin Trp Gly Gly 



610 



615 620 



Ala His Arg Gin Ala Ala Glu Arg Ala Arg Arg Ser Gin Ala Gin Ala 
625 - 630 635 640 

Gly Gly Ala Cys Gin Gly Gin Trp Arg Ala Ser Ser Leu Gly His Arg 
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■ ' 1,9 . ' ' :■■ ■ ■ , 

' 64S ' ' 650 • 655. 

His Ala Ala Arg Ala Pro Gly Gly Arg Leu Leu Pro Ala Leu Pro Ala 
660 565 ,67 0 ' 

Ser Pro Gly Gly Gly Pro Gly Pro Gly Pro Gly Arg Leu Ala His Arg ; * * ' 

675 680 685 ' ■ 

Leu Pro Gly Cly Gly Val Gly Leu Gly Val Phe Aid Leu' Pro Pro Leu 
• 690 . 695 700, : ' . . . 

Gly Ala Pro Val- Arg Gly Gly Ala Ala Val Gin Glu Glu Gin Gly Gly 
705 ' 710 ■ " 715. 720 

Leu Gly Gly Ser Leu Leu Gly Ala Arg Gly Ser Phe Ser Arg Ala Ser 

725 • - 730 • 735 ■ 

Gly Gly Arg Glu Gly Arg Xaa \ . 
740 . 



<210> 166 

t <211> 214 '• t • 

/ <212> PRT , ■ • ' 

<213> Homo sapiens ' 

<220> ' 

<221> SITE 

<222> (214) 

<223> Xaa equals scop translation 

. <400> 166 : , ' 

Met Asn Arg Met Glu Leu Leu Lys Leu Leu Leu Thr Cys Phe Ser Glu 
1 .5 10 • 15 

Ala , Met Tyr Leu Pro Pro Ala Pro Glu Ser Gly Ser Thr Asn Pro Trp 
20 . 25 " 30 

Val Gin Phe Phe Cys Ser thr Glu Asn Arg His Ala Leu Pro Leu Phe 
35 ,• , 40 ., . 45 

Thr Ser Leu Leu Asn Thr Val Cys Ala Tyr Asp Pro Val Gly Tyr -Gly 
50 • 55 60 

He Pro Tyr Ash His Leu Leu Phe Ser Asp Tyr Arg Glu Pro Leu Val 
65 70 • 75 i 80 

Glu/ Glu .Ala Ala, Gin Val Leu lie Val Thr Leu 'Asp His Asp Ser Ala 
85 ' J ' 90 .95 

Ser Ser Ala Ser Pro Thr Val Asp Gly Thr Thr Thr Gly Thr Ala Met 
100 105 ; 110 

Asp Asp Ala Asp Pro Pro Gly Pro Glu Asn Leu Phe Val Asn Tyr Leu 
115 120 * 125 

Ser Arg tie His Arg Glu Glu Asp Phe Gin Phe lie Leu Lys Gly He 
130 v- 135 140 



WO 00/06698 



120 



PCT/US99/17130 



Ala Arg Leu Leu Ser 'Asn Pro Leu Leu Gin Thr Tyr Leu Pro Asn Ser 
145 150" 155 160 

Thr Lys Lys Asp Pro Val Pro Pro Gly Ala Ala Ser Ser Leu Leu Glu 
165 • 170 . ,"\ 175 

Ala Leu Arg Leu Gin Gin Glu lie Pro Leu Leu Arg Ala Glu Glu Gin 
180 185 190 

Arg Arg Pro Arg His Pro Cys Pro His Pro Leu Leu Pro Gin Arg Cys 
195 '200 205 

Pro Gly Arg Ser Val Xaa . . * . • 

210 * . ■ 



<210> 167 

<211> 213 ' ' ■ * 

<212> PRT ■.. 
<213> Homo sapiens 

<220> 
<221> SITE 
<222> (213) 

<223> Xaa-equals stop translation * • ' 

<400> 161 

Met Pro ^ Ser Leu Arg Phe Leu Ala Leu Ala Leu Leu Leu Ala He Leu 

1 , ' . , . 5 . , 10 15 

Pro Ala Leu Pro Asn Ala His Ala Ala Pro Gly lie Gly Gly Leu lie 
20 25 30 

Gly„Gly Gly Ser Gin Ala Ser Ala Lys Glu Glu Pro Gin Ser Asn' Ala 
35 , 40 45. r * , " 

Glh' Pro Ser Ala Asp Glu Arg Lys Gin Arg Leu Leu Ser Gin Ala Glu . 
50 , 55 60 " 

Glu. Thr Arg Gin Arg Leu Thr Asp Leu Lys Ala- Glu Leu Ala Gly Ala 
65 70' 75 80 

Pro Lys Glu lie Ser Glu Ala Gin Afg Thr Leu Ser Lys Leu Val Ser 
85. - 90 95 

Glu Asp Asn Ser Asp Leu 1 ?ro Glu Arg Leu Ser Lys Leu Ser Val Pro 
100 105 110 

Val Leu Glu Gin Arg Leu Ala Ala* Arg Val Asp Glu Leu Ala Leu Trp 
. 115 120 125 

Gin Gin Ala Leu Ser Ala Ala Asn Ser Met Leu He Ser Ala Gin Thr 
130 * 135 . . 140 

Arg Pro Glu Arg Ala Gin Ala Asp He Ser Lys Asn Gin Leu Arg* lie 
145 > 150 . 15S , ISO 

Asp Glu lie Asn Gly Leu Leu Lys Ser Gly Arg Glu Asn 'Asn Lys Pro 
165 170 175 
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leu Thr Asp Glu Arg Arg Ala Leu Leu Glu Ser Thr Ser Arg Ala Ala' 
180 185 190 

Ala Gly Pro Ser lie Phe Kis Pro Gly Gly Val Pro Gly Lys Cys Thr 
195 ' 200 205 

Gin Phe Ala Leu Xaa 

.210 . 



<210> 168, 
<211> 75 
<212> PRT 

<213> Homo sapiens 

' <22G>* 

<221> SITE 

<222> (75) , . 
' <223> Xaa equals stop translation 

<40C>- 168 

Met Phe Thr Scr Phe Gly Leu Ala Ser Pro Arg Tie Leu Phe Cys Phe 
1 5 10 .15 

Cys Phe Phe Asp Leu Gly Phe He Phe Phe Cys Val Leu Tyr Tyr lie 
20 25 30 

Val Lys Gly He Leu Ala Glu Thr Leu val Phe Gly Ala Arg Gly Glu 
35 40 ' 45 

Gin Glu Cys Trp Ala Val Tyr Phe Arg Trp Arg Thr His Leu Gin Thr 
50 55 60 

Phe Gly Leu Phe Ser Phe Asn Cys Ser Val Xaa 
65 70 75 



<210> 169 

<211> 48 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> <4B) 

<223> Xaa equals stop translation 
<400? 169 

Met Phe Leu Cys Leu Phe Phe Phe Phe Phe* Asn Ala Thr Gin Gly Asn 
1 5 10 ,15 

lie .Phe He Ser Phe Leu Ser Gly Leu Pro Gin Cys He Phe He Ser* 
20 25 . • 30 

Phe Glu Thr I.ys Arg Phe Trp Lys Leu Phe Phe Cys Ser Phe Lys Xaa 
35 40 45 
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<210> 170 
<2L1> B8 
<212> PRT .. 
<213> Homo sapiens 

<220> ' ' •• " . * ... 

<221> SITE , , 
<222> (881 ' 
<223> Xaa equals stop translation * 

<400> 170 . , , ■ . ■ 

, Met Gly Leu His Leu Arg Pro Tyr Arg Val Gly Leu Leu Pro Asp Gly 
1 , 5 .• 10 15 

Leu Leu Phe Leu Leu Leu Leu Lou Met Leu Leu Ala. Asp Pro Ala Leu 
20 25 .30 

Pro Ala Gly Arg His Pro Pro' Val Val Leu Val Pro Gly Asp Leu Gly 
■35 '.' 40 ,45 

Asn Gin Leu Glu Ala Lys Leu Asp Lys Pro Thr Val. Val His Tyr Leu 
50 . • 55 , 60 

Cys Ser Lys Lys Thr Glu Scr Pyr phc Thr He Trp Leu Asn Leu Glu 

65 70 75 '■' 90 

Leu Leu Leu Pro Val His His Xaa 
85 



1 <210> 171 . ' 

<211> 42 • .. 

<212> PRT '' 
<213> Homo sapiens 

<220> *' . -' .- 

<221> SITE ■ 

<222> (42) ; , 

<223> Xaa equals stop translation 

<400> 171 v • ■ < 

Met Ala Cys Glu Thr His Gly Val Leu Val Pro Ala His Leu Ser ciy 
• 1 , 5 10 15. 

Leu He Thr Cys Leu Leu Ala Phe Trp Val Pro Ala ser Cys lie Gin 



20 



25 



30 



Arg Cys Ser 
35 



Gly Scr Pro Leu Pro Leu Xaa 
40 



<210> 172 

<211> 48 

<212> PRT ** 

<213> Homo sapiens 
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<220> 

<22l> SITE 

<222> (48) . ' 

;<22 3> Xaa equals stop translation 

<400> 172 

Met Gin Cys Phe Leu Phe Ser lie Phe Leu He Thr Gly Leu Ala Glu 
1 -'• 5 > 10 • ,15 

Glu- Phe Cys Glu Glh Leu Ser lie Ser Leu Ala Glu Glu Glu lie Gin 
• 20 • ' 25 30 

' Leu Ser Ser ThrVal Glu His Phe Cys Met Thr Ala Phe Ser Trp Xaa 
35 40 /* '45' 



<210> 173 

<211> 233 , , 

<212> PRT 

<213> Hoitio sapiens 

<220> • • 

<221> SITE 

<222>.(233> 

<223> Xaa equals stop translation 
<400> 173 ' 

Met Ala, Ala Leu Ala Ala Ala Ala Lys Lys Val Trp Ser Ala Arg Arg 
1 5 ' ' ' 10 15 

Leu Leu Val Leu Leu Phe Thr Pro Leu Ala Leu Leu Pro Val Val Phe 
20 25 30 

Ala Leu Pro Pro Lys Glu Gly Arg Cys Leu Phe Val He Leu Leu' Met 
35 40 45 

Ala Val Tyr Trp Cys Thr Glu Ala Leu Pro Leu Ser Val Thr Ala Leu 
.50 55 60 

Leu Pro He Val Leu Phe Pro Phe Met Gly He Leu Pro Ser Asn Lys . 
65 , 70 . 75 80 

Val Cys Pro Glh Tyr Phe Lpu Asp Thr Asn Phe Leu Phe Leu Ser Cly 
.85 90 95 

Leu He Met Ala Ser Ala He Glu Glu Trp Asn Leu Kis Arq Arg He 
100 105 no ■ 

Ala Leu Lys He Leu Met Leu Val Gly Val Gin Pro Ala Arg Leu lie 

120 125 

Leu Gly Met Ket Val Thr Thr Ser Phe Leu Ser Met Trp Leu Ser Asn 
* 130 135 140 

Thr Ala Ser Thr Ala Met Met Leu Pro lie Ala Asn Ala He Leu Lys 
14 5 • • 150 * 155 ■ 160 
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Ser Leu Phe' Gly Gin Lys Glu Val Arg Lys Asp Pro Ser Gin Glu Ser ' 
165 170 175 

Glu Glu Asn Thr Gly lie Glu Pro Asn Thr Phe Leu Ser Glu GI.u Axg 
180 ' , ' 185 190 

Leu Lys Leu Gin Ala Pro Leu Val- He Arg Leu Gly Glh lie Thr Glu 
195 200 20S 

Ser Gly Gin Trp Asn Met Ser Gly Asn Asp Val Cys Asn Phe Arg Val 
• 210 215 220 

Leu Ser Phe Leu Pro Gly Gly Met Xaa* ' 
225 230 



<210> 174 
<211> 45 
<212> PRT 

*<213> Homo sapiens 
<220> 

<221> SITE 

<222> (45) '• 

<=223> Xaa equals stop translation 

<400> 174 * ' 

.Met Gly. Thr lie Phe Gly Tyr Leu His Cys Val Lys Cys Tyr Val Leu 
•'' 1 5 ^ . 10 15 

Tyr Phe He Phe lie Leu He Thr Ala Val Tyr His Ser Phe Tyr Tyr 

20 25 , 30 . 

Pro His Tyr Arg Gly Lys Ala Leu He Ser Gly Thr Xaa 
3 5 40 45 



<210> 175 ! 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (77) ^ 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> . 
<221> SITE 
<222> (85) 

<223> Xaa equals stop translation 
<40O> 175 

Met Val Trp Phe Leu Phe Leu .Val Phe He Phe Leu Lys Val Lys Gly 

1 5 /10 15 



Asp Phe Phe Pro Pro Phe Leu lie Cys Asn Leu Phe Cys He Trp Met 
20 25 30 
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lie Thr Gly Val Ser His Arg Leu Gin 'Pro Glri lie Leu Phe Ser Arg 
35 ! 40 45 

His, Lys His Asn. Gin Glu lie lie Leu Gin Met Val Ser Phe Ser Cys 
50 ■ 55 • 60 

Cvs Val Phc Phe Pro Met lie Arg Glu Val Lys Ser Xaa Leu Gly Cys 
65 .70 75 80 

He Lys Met Ser Xaa 

85 ' 



<210> 176 

<211> 66 • 

<212> PRT * 

<213> Homo sapiens ,, . * , - ' 1 

<220> 

<221> SITE ' . 
<222> (66) ' 

<223> Xaa equals stop translation ' 
<400> 176 " 

Met Trp Val Leu Leu Ser Cys Pro Leu Pro Pro Leu Cys Leu Pro Ala 

' • 1 . 5 10 N 15 

Ser Ala Val Pro Gly Gin Cys Leu Gly Gly Gin Trp Ser Gly His Gin 

20 • ■ . '25 30 

Leu Arg Leu Arg Gly Arg Gly Trp His Cys Arg Cys His Cys Arg Ala 
. 35 . 40 45 • 

Trp Ala Ala Asp Met . Gly Arg Gly Leu His Ser Cys Gin Leu Leu Ser 
50 ' 55 < .60 

Arg Xaa • 
65 . 



<210> 177 ' 
<211> 55 
<212> PRT 

<213> Homo sapiens ' 
<220> 

<221> SITE 

<222> (55) ' ' 

<223> Xaa equals stop translation' • 

<4O0> 177 ' 

Met Leu Leu Leu Cys lie Leu Leu lie Phe Cys Val Val Gly Leu Ser 

1 ^ 5 . 10 15 

Val Val fUy Arg Arg Val Leu Lys Ser Thr Thr He He Val Tyr Leu 
20 25 30 

Ser He Thr Pro Phe Ser Ser Phe Ser Ser, He Ser His He Phe Gin • 
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35 

^eu Leu lie Gly Ala His Xaa 
50 55 



<210> 178 - 
<211> 83 . 
<212> PRT 

<213> Homo sapiens "' ' ' ■ ' 1 ' ■ . 

<220> ■ . . ■ . 

,<221> SITE . 
<222> (4) ' ' ' 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> , • . y • 
<221> SITE 

<222> (83 J '• 
<223> Xaa equals stop translation 

<400> 178 ' 
-Met Cys Val Xaa Leu Ser Phe Cys Pro Phe Leu' Ser Ser Ala Leu. Pro 
• 1 5 10 15 

Ala Ser His Thr, Gin Phe Tyr Mec Pro Arg Gly Ala Lys Phe Gly Thr 
• 20 25 30 

Phe Thr Leu Gin Ala Ser Val Ser Pro Leu Glu Glu Lys Thr His Ser 

35 . '' . 40 45 : 

Phe Thr His Pro Gly lie Gly Gly Lys Leu Leu Gly Kis Gin Asp Pro 
50 55 60 . 

Gly Ala Pro Gly Pro Ser Trp Asn He Arq Ser Thr Trp Ser Thr Arg 
65 70 75 80. 

Ser Leu Xaa ' 



<210> 179 . 

<211> 330 , ... 

<212> PRT , . ' . 

<213> Homo sapiens - ' 

<220> 

<221> SITE 

<222> (38) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

,<22l> SITE 

<222> (247) > , ' . 

<223> Xaa equals any of the naturally occurring t" rani.no acids 

<400>" 179 ' > 

Met Ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr Ala Val 
1 5 10 15 
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Gly Ala Ala Val Tie Leu Ala Gin Leu Lsu Arg Arg Cys Arg Gly Gly' 
20 25 30 

Phe .Leu Glu Pro Val Xaa Pro Pro Arg Pro Asp Arg Val Ala, lie Val 
. 35 ' ' 40 , 45' ■ , 

Thr Gly Gly Thr Asp 'Gly He Gly Tyr Ser Thr Ala Lys His Leu Ala 
50 55 60 

Arg Leu Gly Met His Val lie Tie Ala Gly Asti Asn Asp Ser ; Lys Ala 
65 . * 70 75 80 

Lys Gin Val Val Ser Lys Tie Lys Glu. Glu Thr Leu Asn Asp Lys* Val . 

85 ' 90 . , 95 

Glu Phe Leu Tyr Cys Asp Leu Ala Ser Met Thr Ser He Arg Gin Phe 
100 10 5 110 • 

Val Gin Lys Phe Lys Met 'Lys Lys He Pro Leu His Val Leu Tie Asn 

115 . 12 0 ' 125 • ]■ 

Asn Ala Gly Val Met Met Val Pro Gin', Arg Lys Thr Arg Asp Gly Phe 
130 135 140 

Glu Glu His Phe Gly Leu Asn Tyr Leu Gly His Phe Leu Leu Thr Asn 
145 • 150. 155 160 

Leu Leu Leu Asp Thr Leu Lys Glu Ser Gly Ser Pro Gly His Ser Ala 
165 170 . '175 

Arg Val Val Thr Val Ser Ser Ala Thr His Tyr Val Ala Glu Leu Asn 

180 ' * * 185 - • 190 * 

Met Asp Asp Leu Gin Ser Ser Ala Cys Tyr Ser Pro His Ala Ala Tyr 
195 " 200 ; 205 

Ala Glh Ser Lys Leu AiLd Leu Val Leu Phe- Thr Tyr His Leu Gin Arg 
, 210 215 220 • • • ■ 

Leu Leu Ala Ala Glu Gly Ser His Val Thr Ala Asn Val Val Asp Pro 
225 230 , 235 240 

Gly val val Asn Thr' Asp Xaa Tyr Lys His Val Phe Trp Ala Thr Arg 
245 250 255 

Leu Ala Lys Lys Leu ueu Gly Trp Leu Leu Phe Lys Thr Pro Asp Glu 
260' - 265 270 

Gly Ala Trp Thr Ser He Tyr Ala Ala Val Thr Pro Glu Leu Glu Gly 
275 280 , 285 

Val Gly Gly Arg Tyr Leu Tyr Ash Glu Lys Glu Thr Lys Ser Leu His - 
290 295 300 

Val Thr Tyr Asn Glh Lys Leu Gin Gin Gin Leu Trp Ser Lys Ser Cys 
305 310 315 , 320- 

Glu Met Thr Gly Val. Leu Asp Val Thr Leu , 
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325 ,330 



<210* 180 
<211> 41 

■ <212> PRT ' - ' • ' 

<213> Homo sapiens ( 

<220> 

<221> SITE 
<222> {41) 

<223> Xaa equals stop translation 
<400> 180 

Met lie Ala Cys Gin Tyr lie Ser Leu Ala lie Met Leu Ala Phe Val 
1. 5 " 10 IS , 

Arg Trp Ala Ala Phe Leu Leu Phe Pro Phe Leu Cys Gly Asp Asn Gly ' 
20 ; 25 30 

Gly Asn lie Gin Gin Lys Tyr. Val Xaa' 
35 ' 40 



<210> 181 ■ • , ... 

<211> 52 

. <212> PRT *.. ' •. 

<213> Homo sapiens , / 

<220> . . . ' . . 

<221> SITE 
<222> (52) 

<223> Xaa equals stop translation 

'<400> 181 " '•• . . ' • • •' 

Met Ala Asn Ala Met Ala Tyr Leu Ser lie Phe Leu Cys Gly Ala, Ser 
1. 5 10 15 

' Ser Ser Pro Cys Asp Cys Ala Leu Leu val Pro" Val Scr Leu Phe Arg * 
20 1 25 30 

Gly Arg Lys Val Ala Asn, Phe Lys Asn Gin Asn Ser Asp Val Thr Ser 
35, 40 45 : 

Gly Asn Ala Xaa 
. 50 



<210> 182 
<2U> 55' 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (55) - 
<223> Xaa equals stop s translation 

<400> 182 
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Met Gin Gin lie Cys Ser Cys Leu Gly Ala Phe Ala Leu Leu PhP Phe 

1 5 10 ' is - 

' Trp Pro Gly His. Phe Thr Ser- Thr Phe Ser He Phe 'iyr Asp Phe Leu 
- 20 25 30 

Pro He Phe Gly Ser Leu Phe Lys Cys His Pro. Ser Lys Arg Pro Ser 
35 40 45 

Lys Leu Pro Tyr Leu Lys Xaa .* ' ' 

50'- 55 



<210> 183 . . 

<211> 62 

<212> PRT " * * * 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals stop translation 
<400> 183 

Met Ar'g.Leu Leu Leu Glu Trp Arg Val Tyr Leu Arg Leu Thr Cys Ala 
1 5 10- 15 

Thr Lys Asp Gly Met Ala Arg Glu Cys Pro Thx Thr Trp Leu Ser Pro 
20 25 30 

Pro Ala Lys Pro Asp Phe Ala 01 a Arg His Ser Val Lys Pro Thr Ala 
35 40 45 

Leu Gin Gly Gly Arg Trp ser Arg Leu Gly Ala Ser Pro Xaa 

. 50 ' 55 60 



<210> 184 ^ ' ' 

<211> 148 * 
<212> PRT 

<213> Homo sapiens 

<220> ' ' ' 

<221> SITE 
<222> (148) 

<223> Xaa eouals stop translation 
<400> 184 

Met Leu Gly Leu Pro Trp Lys Gly Gly Leu Ser Trp Ala Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Gly Ser Gin He Leu Leu lie Tyr Ala Trp His Phe His 
20 25 -30 

Glu Gin Arg Asp Cys Asp Glu His Asn Val Met Ala Arg Tyr Leu Pro 
35 40 45 

Ala Thr Val Glu Phe Ala val His Thr Phe Asa Gin Gin Ser Lys Asp 
50 55 60 
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Tyr Tyr Ala Tyr Arg Leu Gly His .He Leu Asn Ser Trp Lys Glu Gin 
65 J • 70 75 80 

Val Glu Ser Lys Thr Val Phe Ser Met Glu Leu Leu Leu Gly Arg Thr 
85 90 95 

Arg Cys.Gly Lye Phe Glu Asp Asp lie Asp Asn Cys His Phe Gin Glu 
100 , ; . 105 110 

Ser Thr Glu Leu Asn Asn Thr Phe Thr Cys .Phe Phe Thr Tie Ser Thr 
115 120 125 

Arg Pro Trp Met Thr Gin Phe Ser Leu Leu Asn Lys Thr Cys Leu Glu 
: 130 135 140 

Gly 1 Phe His Xaa , 
145 



<210> 135^ ' " ' ' ■ ' 

<211> 161 ' 
<212> PRT 

<213> Homo sapiens 

<220> ' 
<221> SITE ' 
<222> • (146) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (151) 

<223> Xaa equals, any of the naturally occurring L- amino acids 
<220> 

<221> SITE 

<222> (161) . . . 

<223> Xaei equals stop translation 
<400> 185 " , 

Met Arg Leu Leu Cys Gly Leu Trp Leu Trp Leu Ser Leu* Leu Lys Val 
1 ' 5 '10 15 

Leu Gin Ala -Gin Thr Pro Thr Pro Leu Pro Leu Pro Pro Pro Met Gin 
20 25 ' 30 

Ser Pha Gin Gly Asn Gin Phe Gin Gly Glu Trp Phe Val Leu Gly Leu 
35 ' ' 40 45 

Ala Gly Asn Ser Phe Arg Pro Glu His Arg Ala Leu Leu Asn Ala Phe 
50 55 60 

Thr Ala Thr Phe Glu Leu Ser Asp Asp Gly Arg Phe Glu Val Trp Asr.' 
65 70 , 75 ■ 80 

Ala Met Thr Arg Gly Gin His Cys Asp Thr Trp Ser Tyr Val Leu lie 
85 '90 I ' 95 



0 
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Pro Ala Ala Gin Pro Gly Gin Phe Thr Val Asp His. Gly Val ' Gly Arg 

. . 100 105 - 110 • • 

Eer<Trp Leu Leu Pro Pro Gly Thr Leu Asp Glti Phe lie Cys Leu Gly 

115, ' ,120 125 . • 

Arg Ala' Gin Gly Leu Ser Asp Asp Asn lie Val Phe Pro Asp Val Thr 

130 135 . , 140. 

Gly Xaa Ala Leu Asp Leu Xaa Ser Leu Pro Trp Val Ala Ala Pro Ala 

145 : 150 155 160 



<210> 186 -■ ■ 

<211> 122 ' \ ] ' ' 

<212> PRT . , . - • 

<213> Homo sapiens .'. , ' 

<22 0> , ' . 

<221> 'SITE ' . '.' "*' 

<222> (122) 

<223> Xaa equals stop translation V 
<400> 186 ' ' • . 

Met Met Leu Pro Gin Trp Leu Leu Leu Leu Phe Leu Leu Phe Phe Phe 

1 ' 5 ,'i0 '15 

Leu Phe Leu- Leu Thr Arg Gly Ser Leu Ser Pro Thr Lys Tyr Asn Leu 
20, 25 30 

Leu Glu Leu Lys Glu Ser Cys lie Arg Asn Gin Asp Cys Glu Thr -Gly 
35 40 45 ' 

Cys Cys Gin Arg Ala Pro ,Asp Asn Cys Glu Ser His Cys Ala Glu Lys ! 
50 . 55 ' , 60 . 

Gly Ser Glu Gly Ser Leu Cys' Gin Thr Gin Val Phe Phe Gly Gin jryr 
65 -70 75. 80 

Arg Ala Cys Pro Cys Leu Arg Asri Leu Thr Cys lie fyr Ser Lys Asn 
85 90 - 95 

Glu Lys Trp Leu Ser lie Ala Tyr Gly Arg Cys Gin Lys lie Gly Arg 
100 105 110 

Gin Lys Leu Ala Lys Lys Met Phe Phe xaa 
115 120 



<210> 187" 
<211> 163 
<212> PRT 
<213> Homo sapiens 

<220> 

<22l> SITE 
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<222> (163) 

<223> Xaa equals scop translation . 
<400> 187 . 

Met Thr Ser Asn.Phe Pro Phe Cys Thr Leu He Leu Gly lie Ala Gin 
1 ' 5 10 15 

Ala Clri Ala Cya Pro Gly Cys Pro Gly Asp Trp Pro Gly Leu Gly Ser 
20 25 < 30 

Gly Val Gly Glu Gly Leu His His Iie Arg, Thr Cys Arg Thr Pro He 
, 3.5 40 t 45 ' 

Pro Cys Ser Pro Pro Ala Pro Ala. Ala Ala Cys Leu Gly Ser Gly His *' 
.50 55 60 

Ala Arg Leu' Pro Cys Val Leu Arg Leu Trp Pro Val Pro Ala Asn Leu 
65 70 75 . 80 

Ser Ser Pro Phe Arg Leu Glu Ala Leu His Cys , Ser Phe ,Trp Ser Ser 

' , , 85 ' , 90 • 95 • 

Pro Leu Leu -Pro Ala Pro His Leu Ala Phe Phe Gly Phe - Arg Asp Leu 
" 100 105 110 

Leu, Thr Asp Phe Leu Leu Ala Ala Cys Leu Leu Thr Phe Gin Lys Thr 
115 120 12S 

Pro Leu Glu beu, Pro Met Ala Val Val His Leu Leu Val Ala Thr Pro 
130 , 135 , . 140 

Cys Tyr Gin Het Leu Asp Asn Leu Pro Leu Pro Ser Ala Ala Ala Asn 
145' 150 . 155 ■ 160 

Trp Cys Xaa 



<210> 188 ' ..*..- 
<211> 51 

<212> PRT , • ' 

<213> Homo sapiens 

<220> ' ' ; ' , 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 188 , 

Met Pro Gly He Leu Ala Gly He Pro Val Lys Asp Leu Cys Leu Ser 
1 ' 5 10 15 

Leu Leu Gin' Gly phe Arg Leu Leu" Leu Leu Cys Val Cvs Pro Gly Trp 
20 .25 * 30 

Leu Ser Gly' Trp Met Gly Gly Gin Lys Gly Ser Pro Arg He Val Asp 

< ■ 35 , ' 4C. ; • 45- ■ ♦ 

He Gly Xaa • 



WO 00/06698 



133 



PCIYUS99/17130 



50 



<210> 189 
<211> 65 , 
<212> PFT 

<213> Hof.o sapiens 

<220> ' 

<221> SITE 

<222> (65) 

<223> Xaa equals stop translation 



Tyr Leu .Gly, val Phe, Phe His Leu lie Tyr Pro Gly Ala 
5- , 10 15 

Leu Ser He Thr Thr Leu Gly <Lys His Ser His Pro Phe Phe Thr Ala 
20 2b ' - 30 ' 

Glu Gin Asn Ser Thr Val Trp Met Glu His Thr Leu Phe His Gin Ser 

3 5. • , 40 45 ' .'' 

Pro Val Ala Ser His Leu Val Cys Phe Gin Ser- Phe Ala Phe Ser Glu 

50 - . 55 60 

Xaa 

65 ■■ . - 



<210> 190 

<211> 47 ■ ■ - 

<212> PRT 

<213> Homo sapiens 

<220> • / • • • • • . 

«221> SITS 

<222> (47) ■ , ' ' 

<223* Xaa equals s^op translation ■ ■ , • 

<400> 190 '■ , 

Met Thr Leu Ser Leu Gin Leu Ala Glu Leu Val His Phe .Val Cys Ala 
1 ' 5 , , ' 1 10 IS 

Phe Gin Ser Gin. Trp Thr Gly Val Tyr Pro Met Met Pro Pro Leu Lys 
20 : 25 " 30 

Pro Thr Glu Pro Leu 
35 



<210> 191 
<211> 144 
<212> PRT 
• <213> Homo sapiens 

<220> 

<221> SITE 
<222> ( 144) 



<400> 169 
Met Tyr Leu 
1 



Cys Phe Ala Cys Val Pro ,Cys Arg Val Xaa 
40 • 45 
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<22 3> Xaa equals stop translation 
<400> 191 ' t ' ■ ■ 

Met Ser Pro Phe His Leu Leu Gly Leu Lys Val Phe Le\i Thr Trp Ala 
1 5 ' , 1C ' • • • . 15 

Leu Thr Leu Ala Gin lie Cys Leu Tyr Phe Phe Glu' Val Gin Pro Leu 
20 25 30 

Gly Leu Leu Ala ' Leu Asn Phe Phe Cys Thr Ala Thr. Ala Gly Leu Lys' 
35 40 45 

Glu Lau Cys Met His Pro Pro Ser Leu Ala Phe Thr Pro Glu Phe His 

50 . , , 55 60 / 

Thr Ser Leu Ser Pro Leu Ala He Pro Ser Phe Cys Gly Thr Ser Val 
65 70 75 " * \ 80 

Ser Leu Ser Asn Ser His Thr lie Pro Leu Ser Leu, Tyr , Leu * Pro Phe 

85 ' .90 . ' ( ' 95 

Pro Ser Lys Ser Arg Met Pro Asp Thr Leu His Leu Leu Val Ills Ser 
100 105 110 

Leu Pro Leu Val His Ser Gin Val Leu Pro Val Lys Asp Val' Thr He 
115 120 125 

Glu Trp Pro Leu Cys Gin Arg Cys Leu Gly Ser Thr Cys His Glh Xaa 
130 * 135 140 



<210> 192 

<211> 81 

<212> PRT . . ' 

<213> Homo sapiens ' 

<220> ■ ' ' • " 

<221> SITE 

<222> (76) 

<223> Xaa equals any ot the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> {81) . . 

<^223> Xaa equals any of ,the naturally occurring L*- amino acids 

<400> 192 , ' ■ ' ' ' . " • . 

Mat Phe Cys Phe Ser Ser He Phe Cys Ser His Glu His Thr His Leu 
1 5 . 10 ' 15 

Pro Gly Thr Phe Trp Leu Phe Leu Phe Leu Phe Leu He Leu Pro Pro 

' 20 2 5 . 30 



Ser Cys Pro Cys Phe Leu Pro Phe Ser Leu Ala He, Glu Thr Val Arg 
35 40 45 
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Trp Pro Cys Trp His His Pro Thr Ser Phe Glu Leu Cys Tyr Pro : Gly 
50 55 60 

Thr. Ser He Tyr Tyr Ala Ser Arg Gly Gly Pro. Xaa Pro Asn Ser Glu 
'65 - 70 ,75 80 

Xaa . 1 



<210>;193 
"<2li> 45- • " * 

• <212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE" 

, <222> (45) 

<223> Xad equals stop translation . . 

<400> 193 ■ 

Met Thr Tyr Leu Pho Cys Ser Ser lie Ser Leu Leu Leu Leu Lys Val 
1 ' 5 . 10 15 

His Ser Ser Gly His Gin Asp He Arg Lys Ala Lys Ser Lys Val Pro 
20 25 30 

. Arg Leu Leu He He Gin Cys Pro Gin Gin Arg Glu Xaa 
35 40 45 



<210> 194 

<211> 42 . ' 

<212> ?RT 

<213> Homo sapiens 
' <220> 

<221> SITE ** . 

*<222> (42) 

<223> Xaa equals stop translation 
<400> 194 

Met Pro Thr lie Trp Val Lys Leu Cys Leu Leu Gin Val Cys His Gly 
.1 5 • 10 15". 

Leu Phe Pro Leu Leu Lys His Trp Ser Gin .Pro Met Pro Leu Cys Val 
20 25 30 



Thr Leu Ala Pro val Ser Tyr. Trp Leu Xaa 
35 40 



<210>.195 ' 
<211> 260, 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
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• <222> (260) ' 
<22 3> Xaa equals stop translation ' 

<400> 195 

Met- Gly Thr Ala Ala Leu Gly Pro val Trp Ala Ala Leu Leu Leu Phe ■ 
1 5 1C 15 , 

Leu Leu Met Cys Glu Tie Pro Met Val Glu Leu Thr Phe Asp Arg Ala ' 
20 25 30. 

Val Ala Ser Asp Cys Gin Arg Cys Cys Asp Ser Glu Asp Pro Leu Asp 

35 '40 " 45- • 

Pro Ala His Val Ser Ser Ala Ser Ser. Ser Gly Arg Pro His Ala Leu 
50 b!> • 60 , 

Pro Glu He Arg Pro Tyr lie Asn lie Thr lie Leu Lys Gly Asp Lys 
65 .70 75 . 80 

Gly* Asp Pro Gly Pro Met Gly. Leu Pro Gly Tyr Met Gly Arg Glu Gly * 
85 90 , 95 

Pro Gin Gly Glu Pro Gly Pro Gin Gly Ser Lys Gly Asp Lys Gly Glu 
; . • 100 105 110 

Met Gly Ser Pro Gly Ala Pro Cys Gin Lys* Arg Phe Phe Ala Phe Ser 
115 ' 120 " 125 

' Val Gly Arg Lys Thr Ala Leu His Ser Gly Glu Asp Phe Gin Thr Leu 
130 . 135 \ , 140 

Leu Phe Glu Arg Val Phe Val Asn Leu Asp Gly Cys Phe Asp Met Ala 
"145 ISO 155 160 

Thr Gly Gin Phe Ala Ala Pro Leu Arg Gly He Tyr Phe Phe Ser Leu 
165 170 175 

Asn Val His Ser Trp Asn Tyr Lys Glu Thr Tyr Val His He Met His 
180 185 190 

Asn Gin Lys Glu Ala Val He Leu Tyr Ala Gin Pro Ser Glu Arg Ser 
195 200 ,205 

lie Met Gin Ser Gin Ser Val Met Leu Asp Leu Ala Tyr Gly Asp Arg 
210 215 _ 220 • 

Val Trp Val Arg Leu Phe Lys Arg Gin Arg Glu Asn Ala He Tyr Ser 
225 230 235 240 

Asn Asp Phe Asp Thr Tyr He Thr Phe Ser Gly His Leu He Lys Ala 
. *■ 245> 250 . 253 

Glu Asp Asp Xaa 

<210> 196 ' 
<211> 117 ' 
<212> PRT 
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<213> Homo sapiens • ' * "' 

. <400> 196 *" • ;• • ' j ' 

Met Leu Gly His Cys Cys Tyr Phe Trp Gin Val Trp Pro Ala Ser Glu 
1 " • .5 /■ \ 10 ; 15 

Ala Leu Ala Ala Gly ^Pro Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser 
20 , f 25 . ■ 30 " 

. Trp Lys Gin His lie. Gly. Thr Ser Leu Gin Lys Thr Arg Gly Ser: Leu " 
35 h 40 . t 45- ' 

Pro Thr t Thr Thr Leu Thr Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly 

50 1 " 55 60 ' ' 

Lys Asn Pro Ala Ala Gly 'Arg SerrLeu Glu Gly Ala Leu Pro Ala Gly' 
■ ■ 65 A 70 " 75 • 80 ' 

*Val Trp Pro cys Phe Ala ' Gin Ser Pro Cys Thr Gly Gly 'Gin Glr/?hr 

; , 85 90 • 95 + 

Pro Ser Ser Thr Gly Leu Arg Ser Cys' Leu Val Arg Ser Pro Ala Thr 

100 •.' >' 105 -110 ' \ •• , 

Trp Trp Arg Thr ? Pro v ' 

115 

. <210> 197 ' 

. <211> 698 . • . ' , 

<212> PRT • • t 

<213> Homo sapiens ' ' 

<400> 197 

Met Leu Pro Ala Arg Leu Pro Phe Arg Leu Leu Ser Leu Phe Leu Arg 
1 5 .10 15 

Gly. Ser Ala Pro Thr Ala Ala Arg His Gly Leu Arg Glu Pro Leu Leu"' 

20 • ; • 25- 30 • 

Glu Arg Arg Cys Ala Ala Ala Ser Ser Phe Gin His Ser Ser Ser Leu 

35 . .. ■ 40 45 ^ 

Gly Arg Glu Leu Pro Tyr Asp Pro Val Asp Thr Giu Gly Phe Gly Glu 
• 50 55 60 

Gly Gly Asp Met Gin Glu Arg Phe Leu Phe Pro Glu Tyr lie Leu Aso 

65 . . 70 .... 75 fi 80 

Pro Glu Pro Glh Pro Thr Arg Glu Lys' Gin Leu Gin Glu Leu Gin Gin . 

'85 ' v ■■ . 90 95 

Gin Gin Glu Glu Glu Glu Arg Gin Arg Gin Gin Arg Arg Glu Glu Arg 
*. 100 105 . ' 110 

Arg Gin Gin Asn Leu Arg Ala' Arg Scr Arc Glu His Pro Val Val Gly 

115 " - 120 * 125 " 

His Pro Asp Pro Ala Leu Pro- Pro Scr Gly Val Asn Cys Ser Gly Cys 
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130 135 140 

Gly Ala Glu Leu His Cys Gin Asp Ala Gly Val Pro Gly 'Tyr Leu Pro 
145 150 155 160 

Arg Glu Lys Phe Leu Arg Thr Ala Glu. Ala Asp Gly Gly Leu Ala Arg 
165 17C 175 •= 

Thr val Cys Gin Arg Cys Txrp Leu Leu Ser His His Arg Arg Ala Leu. 

180 * -185 190 . 

Arg Leu Gin Val Ser Arg Glu Gin Tyr Leu Glu Leu Val Ser Ala Ala 
195 ' . ' 200 205 . 

Leu Arg Arg Pro Gly Pro Ser Leu Val Leu Tyr Met Val 'Asp Leu Leu 
210 ■ 215 ■ . 220 1 

Asp Leu Pro Asp Ala ' Leu Leu Pro Asp Leu Pro Ala Leu Val Gly Pro 
225 * 230 235 \ 240 

Lys Gin Leu lie Val Leu Gly Asn Lys Val Asp Leu Leu Pro Gin Asp 
245 - 250 * 255 

Ala Pro Gly Tyr Arg Gin Arg Leu Arg Glu Arg Leu Trp Glu Asp Cys 
'. 260 265 : • ,270 

Ala Arg Ala Gly Leu Leu Leu Ala rro Gly His Gin Gly Pro Gin Arg 
275 280 285 

Pro Val Lys Asp Glu Pro Gin Asp Gly Glu Asn Pro Asn Pro Pro Asn 
290 295 300 

Trp Ser Arg Thr Val Val Arg Asp Val Arg Leu lie Ser Ala Lys Thr 
305 , 310 315 320 

Gly Tyr Gly Val Glu Glu Leu lie Ser Ala Leu Gin Arg - Ser Trp Arg 
325 330 335 

.Tyr Arg Gly Asp Vai Tyr Leu Val Gly Ala Thr Asn Ala Gly Lys Ser 
340 345 350 

Thr Leu Phe* Asn Thr Leu Leu. Glu Ser Asp Tyr Cys Thx Ala Lys Gly 
355 360 ' 365 

Ser Asp Ala He Asp Arg Ala Thr He Ser Pro Trp Pro Gly Thr Thr 
370 375 380 

Leu Asn Leu Leu Lys Phe Pro lie Cys Asn pro Thr Pro Tyr Arg Met 
385 390 395 : 400 

Phe T.ys Arg- His Gin Arg Leu Lys Lys Asp Ser Thr Glh Ala Glu Glu 
. 405 410 415 

'.■ 

Asp Leu Ser Glu Gin Glu Gin Asn Gin Leu Asn Val Leu Lys Lys His 
420 425 430 

Gly Tyr Val ■ Val Gly Arg Val Gly Arg Thr Phe Leu Tyr Ser Glu Glu 
435 440, 445 
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Gin Lys,. Asp Asn lie Pro Phe Glu Phe Asp Ala Asp Ser Leu Ala Phe 
450 . 455 460 

Asp Met Glu Asn Asp Pro Val Met Gly Thr His Lys Ser Thr Lys Gin 
. 4 <55 470 475 480 ' 

Val Glu Leu Thr Ala* Gin Asp Val Lys Asp Ala His Trp Phe Tyr Asp 
485 490 495 

Thr Pro Gly lie Thr Lys Glu Asn Cys He Leu Asn Leu Leu Thr Glu 
500 505 510 

Lys Glu Val Asn Ml e Val Leu Pro' Thr Gin Ser He Val Pro Arg Thr' 
> '515, • 520 525 ' 

)i. Phe Val 5 Leu Lys Pro Gly Mec ,Val*Leu Phe Leu*'Gly Ala lie Gly Arg 
530 535 540 

lie Asp Phe Lou 3 in Gly Asn Gin Ser Ala Trp Phe Thr Val Val Ala 

. > 545 ■■ 550 . ' 555v 5S0 : . 

Ser Asn He Leu Pro Val His He Thr Ser Leu'., Asp 'Arg Ala Asp Ala 
565 570 / 575 

Leu Tyr Gin Lys His Ala Gly His Thr Leu Leu Gin lie Pro Met Gly 
580 585 590 

Gly Lys Glu Arg Met Ala Gly Phe Pro Pro Leu Val Ala Glu Asp lie 
595 600 - 605 

'» . .. r ' ' ■ 

Met Leu Lys Glu Gly Leu Gly Ala Ser Glu Ala Val Ala Asp He Lys 
610- ' 515 -620 

Phe Ser Ser Ala Gly Trp Val Ser Val Thr Pro Asn Phe Lvs Asp Arg 
6 25 630 635 640, 

Leu His Leu Arg Gly Tyr Thr Pro Glu Gly Thr Val Leu Thr Val Arg 

; ' ' £45 • > 650 • 655 

Pro Pro Leu Leu Pro Tyr He Val Asn He Lys Gly Gin Arg He Lys 

660 665 . ■ ", 670 / 

..... 

Lys Ser Val Ala Tyr Lys Thr Lys Lys Pro Pro Ser . Leu Met Tyr' Asn 
675 680 6B5 

' . ./. , • • 

Val Arg Lys Lys Lys Gly r.ys He Asn Val 
690 , 695 



,<210> 198 

<211> 348 • 

<212> PRT. ' ] 

<213> Homo sapiens 

<400> 198 

Met Asn Met < Thr Gin Ala Arg Val Leu Val Ala Ala Val Val Gly Leu 
1 5 10' 15 

Val Ala Val Leu Leu Tyr Ala Ser He His Lys lie Glu Glu Gly His 
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20 . 25 t 30 

Leu Ala Val Tyr Tyr Arg Gly Gly Ala Leu Leu Thr Ser Pro Ser Gly 

. 35 40 45 . 

' Pro Gly Tyr His" lie Met Leu Pro Phe lie Thr Thr Phe Arg s'ef Val 
50 55 60 

Gin Thr Thr Leu Gin Thr Asp Glu Val. Lys Asn Val' Pro Cys Gly Thx 
65 70 75 80 

Ser Gly Gly Val Met lie Tyr. lie Asp Arg lie Glu Val Val Asn Met 

■ 85 90 95 ■ ■ 

Leu Ala Pro Tyr Ala Val Phe Asp lie Val Arg Asn Tyr Thr Ala Asp 
100 105- no 

Tyr Asp Lys Thr, Leu lie Phe Asn Lys lie His His Glu Leu Asn 31n 
115 120 _ . 125 

Phe Cys Ser Ala His Thr Leu Gin Glu Val ■ Tyr lie Glu Leu Phe Asp 
130 135 140 . 

' Gin He Asp Glu Asn Leu Lys Gin Ala Leu Gin Lys Asp Leu Asn Leu 
' 145 150 ,. . 155 • • ISO' 

: ' . ■ ' " i • ■ 

Met Ala Pro Gly. Leu Thr lie Glh Ala Val Arg Val Thr Lys Pro Lys 
165 . 170, ' . 175 ■ 

lie Pro Glu Ala lie Arg Arg Asn Phe Glu Leu Met Glu Ala Glu Lys 
ISO .195 • , 190 

Thr Lys Leu Leu rle Ala Ala Gin Lys Gin Lys Val Val Glu Lys Glu/ 
195 200 205 

Ala Glu Thr Glu Arg Lys Lys Ala Val' lie Glu Ala Glu Lys He Ala 

210 , ' '* r- > 215 • - . ■ '220 

Gin Val Ala Lys lie Arg Phe Glh Gin Lys Val Met, Glu Lys Glu Thr 
225 230 235 , : 240 ' 

Glu Lys Arg lie Ser Glu He Glu Asp Ala Ala Phe Leu Ala Arg Glu 
245 ' 250 255 

Lys Ala Lys Ala Asp Ala Glu Tyr Tyr Ala Ala His Lys Tvr Ala Thr 
260 265 270 

Scr Asn Lys His Lys Leu Thr Pro Glu Tyr Leu Glu Leu Lys Lys Tyr 
275 280 ' 285 

Gin Ala He Ala Ser Asn Sor Lys Tie Tyr Phe Gly Scr Asn lie Pro 
'293 ■ ■ ** 295 300 ' > 

Asn Met Phe Val Asp Ser Ser Cys Ala Leu Lys Tyr Ser Asp He Arg 
3 ,°5 310 315 ' 320 

Thr - Gly Arg Glu Ser Ser Leu Pro Ser Lys Glu Ala Leu Glu Pro Ser 
■325 330 . 335 
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Gly Glu Asn Val lie Gin Asn Lys Glu Ser Thr Gly 
3*0 ' 345 , 



<210>'199 , . 

<211> 401 ( ' ' ' 

<212> PRT ' 

<213> Homo sapiens 

<220> 

<221> SITE,, . 
<222> (307) ■ 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 199^ ' 

Met Met Gly Leu Gly Asn Gly Arg Arg Ser Met Lys'Ser Pro Pro Leu 
1 ■ 5 '10 .15 

Val Leu Ala Ala Leu Val Ala Cys lie. lie Val Leu Gly Phe Asn Tyr 
20 25 30 

Trp lie Ala Ser Ser Arg Ser Val Asp Leu Gin Thr Arg lie Met Glu 
35 40 45. 

Leu Glu Gly Arcf Val Arg Arg Arg Ala Ala Glu Arg Gly Ala' Val Glu 
■ 50 55 60 

Leu Lys Lys Asn Glu Phe . Gin Gly Glu Leu Glu Lys Gin' Arg Glu Gin 
65 70 75 ,80 

Leu Asp Lys lie .Gln.'Ser Ser His Asn Phe Gin Leu Glu Ser Val Asn' 
•85 . 90 95 

Lys Leu Tyr Gin' Asp Glu Lys Ala Val Leu Val Asn Asn lie Thr Thr 
100 ,105 110 

Gly Glu 1 Arg Leu He Arg Val Leu Gin Asp Gin Leu Lys Thr Leu Gin 
' 115 120 125 

Arg Asn Tyr Gly Arg Leu Gin Gin Asp Val Leu Gin Phe Gin Lys Asn 
130 135 140 

Gin Thr Asn Leu Glu Arg Lys Phe Ser Tyr Asp Leu Ser Gin Cys He 
145 150 155 160 

Asn Gin Mec Lys Glu Val Lys Glu Gin Cyc Glu Clu Arg He Glu Glu 
"'165 170 175 

Val Thr Lys Lys Gly Asn Glu Ala Val Ala- Ser Arg Asp Leu Ser Glu 
180 . 185 '< 190 

Asn Asn Asp Gin- Arg Gin Gin Leu Gin Ala Leu Ser Glu Pro Gin Pro 
195 . ' 200 205 

Arg Leu Gin Ala Ala Gly Leu Pro His Thr Glu Val Pro Gin Gly Lys 
210 215 220 

Gly Asn Val Leu Gly Asn Ser Lys Ser Gin Thr Pro Ala Pro Ser Ser 
225 * 230. 235 240 
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Glu Val val Leu Asp .Ser ,Lys Arg Gin Val Glu Lys Glu Glu Thr Asn 

• . ■ 245 • ; . ■ ' 2.50 .\ " "" 255 <■ • 

Glu lie Gin, Val Val Asn Glu Glu Pro Gin Arg Asp Arg Leu Pro Gin 

.. ; 250 265 270 * 

Glu Pro Gly Arg Glu Gin Val Val Glu Asp Arg Pro Val Gly Gly Arg 

275 '. ' - •' % • : 280 ' . 285 

! Gly Phe Gly Gly Ala Gly Glu Leu Gly Gin Thr Pro Gin Val Gin Ala' 

290 , , , . 295 . ' . - . -300 " ., 

Ala Leu Xaa Val Ser Gin' Glu Asn Pro Glu Met Glu Gly Pro Glu Arg 
305 J -310 ' . ' • ' 315 , 320 

Asp Gin Leu Val lie' Pro, Asp Gly Gin Glu Glu Glu> Gin Glu Ala Ala 

'.' 325 J V. ^ 330 , .335 

Gly Glu Gly Arg Asn Gin Gin Lys Leu Arg Gly Glu Asp Asp Tyr Asn 

340 > - • ' 34 5 ' ; ,' .350 

Met Asp Glu Asn Glu Ala Glu Ser Glu thr Asp Lys Gin Ala Ala Leu 
355 ' ., ,v ,360 ■ ,365, 

Ala Gly Asn Asp Arg Asn He Asp' Val Phe Asn Val Glu Asp Gin Lys 
370 , . \ • ' 375 380 

, Arg Asp Thr lie Asn Leu Leu, Asp Gin Arg Glu Lys Arb Asn His Thr ' 
385' 390 _ ^ 395 ■ ■ 400 

Leu '• , ' :.' 



<210> 200 
<211> 324 .. 
<212>" PRT 1 r 
<213> Homo sapiens 

<220> 

<221> SITE • 
<222> (3) 

< 2 23 > Xaa equals any of the 'naturally occurring L-amino acids' 

<400> 200 . • ; ' . 

Met Glu Xaa , Ala Lys Val Tyr Val Ala Lys Val Asp Cys Thr Ala His 

- / 5 ' ' ' 10 15 

Ser Asp Val Cys Ser Ala Gin- Gly val Arg Gly Tyr Pro Thr . Leu Lys 

- v . 20 25 3 0 

Leu Phe Lys Pro Gly Gin Glu Ala Val Lys Tyr Gin 'Gly Pro Arg- Asp > 
35 • ' ' ... 40 45.' . 

Phe Gin Thr Leu Clu Aon Trp Met Leu Gin Thr Leu Asn Glu Glu Pr o 

,/'. 50 ■ • • ' 55 it . , 60 • ,. ' 

Val Thr Pro Glu Pro Glu Val Glu Pro 1 Pro Ser 'Ala Pro Glu Leu' Lys * 
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Glri Gly Leu Tyr Glu Leu Ser Ala Ser Asn Phe Glu Leu His Val Ala 
85 ' . 90 95 

" Gin Gly Asp His Phe ^Ile Lys Phe Phe Ala Pro Trp'Cys Gly His Cys 
100 105 HO 

Lys Ala Leu Ala- .Pro -Thr Trp Glu Gin Leu Ala Leu Gly Leu Glu His 
115 ■ ■ ; •; 120 ' 125 

Ser Glu Thr Val Lys lie Gly Lys Val Asp Cys .Thr Gin His' Tyr Glu 
130 135 , 140, 

Leu Cys Ser Gly Asn Gin Val" Arg Gly Tyr Pro Thr Leu' Leu T-p Phe 
145 ■ * 150 155 ' 160 

Arg Asp Gly Lys Lys Val 'Asp Gin Tyr Lys Gly Lys Arg Asp' Leii Clu 
165 ' • 170 • 175 

Ser Leu Arg Glu Tyr Val Glu Ser Gin Leu Gin Arg Thr Glu Thr Gly 
180 . 185 - ... 190 

Ala Thr .Glu Thr Val Thr Pro Ser Glu Ala Pro Val Leu Ala Ala Glu 

1 95 . .-■ 200 : - 205 . 

Pro Glu Ala Asp Lys Gly Thr Val Leu Ala Leu Thr Glu Asn Asn Fhe 
" '210 215 220 

Asp Asp Thr lie Ala Glu Gly He Thr Phe lie Lys Phe Tyr Ala Pro " 
225 230 235 ' 240 ' 

Trp Cys Gly His Cys Lys Thr Leu Ala Pro Thr Trp Glu Glu Leu Ser 

j ■ . ' • ■•■ • 245 250 ■ 255 

Lys Lys Glu Phe Pro Gly, Leu Ala Gly Val Lys lie Ala Glu Val Asp 
' 260 .' / t ■ * .265 .270 

Cys Thr Ala Glu Arg Asn lie Cys Ser Lys Tyr Ser Val Arg Gly Tyr 
- . 275 . . , . 280 285 

Pro Thr LeuLeu Leu Phe Arg Gly Gly Lys Lys Val Ser Glu His Ser 
290 295 300 

Gly Gly Arg Asp Leu Asp Ser Leu His Arg Phe Val Leu ' Ser Gin Ala 
305 310 315 ■■ 320 

Lys Asp 1 . Glu Leu .1 



<210> 201 

. <211> 90 , , ' 

<212> PRT ' 

<213> Homo sapiens ... 

<400> 201 - * ■ ■■ , . 

Met Ala Leu Phe ser Cys .'Leu Leu Leu Leu Lys Gin Ser Asp Gly Ala 

" 1 '• 5 10 ; ■ - 15 • 
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Ser Pro Val Leu Arg Ala Leu Ala Ala Ser Cys Leu Ala Ser Pro A">a 
20 25 30 

Gly. Cys Cys Gly Thr Arg Lys Ala Leu Asn Gly Asn Val Gly Giu Lys 
35 40 ' 45 

Val' Gly Phe Thr Phe Mec Set Phe Gin Gly Cys Asp Pro Ser Ser Pro 
50 55 60 

Gly Cys Leu Cys Cys Ser Leu Leu Pro Ser Asn Ser Gin Leu Val Phe 
65 70 75 80 

lie Ser Phe Leu Val Leu Ser Gly Leu Ala . * . 

85 '90 



<210> 202 
<211> 243 
<212> PRT * 
<213> Homo - sapiens 

<400> 202 

, Met Arg Pro* Gin Gly Pro Ala Ala Ser Pro Gin Arg. Leu Arg Gly Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala Ser Glu 
20 25 30 

lie Pro Lys Gly Lys Gin Lys Ala Gin Leu Arg Gin Arg Glu Val Val 
35 40 45 

Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala Gly Val Pro Gly 
50 55 60 

Arg Asp Gly Ser Pro Gly Ala Asn Gly He Pro Gly Thr Pro Gly He 
65 70 75 80 

Pro Gly Arg/ Asp Gly Phe Lys Gly Glu Lys Gly Glu Cys Leu Arg Glu 
85 90 95 

Ser Phe Glu Glu Ser Trp Thr Pro Asn Tyr Lys Gin Cys Ser Trp Ser 
100 ios 110 

Ser Leu Asn Tyr Gly He Asp Leu Gly Lys lie Ala Glu Cys Thr Phe 
115 120 125 

Thr Lys Met Arg Ser Asn Ser Ala Leu Arg Val Leu Phe Ser Gly Ser 
.130 135 140 

Leu Arg Leu Lys Cys Arg Asn Ala Cys Cys Glu Arg Trp Tyr Phe Thr 
145 150 155 160 

Phe Asn Gly Ala Glu Cys Ser Gly Pro Leu Pro He Glu Ala lie lie 
165 170 175 

Tyr Leu Asp Gin Gly Ser Pro Glu Met Asn Ser Thr He Asn He His 
180 , 185 190 
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Arg Thr *Ser Ser Val Glu Gly Leu Cys Glu Gly lie Gly Ala Gly Leu 

19S 200 205' ' ' . 

Val Asp val Ala He ^rp Val Gly Thr Cys Ser Aso Tyr Pro Lys Gly 

210 ,215 * 220 • r t 

Asp Ala Ser -Thr Gly Trp Asn Ser Val Ser Arg lie lie lie Glu Glu 
225 230 235 240 



Leu Pro Lys 



<210>* 203 

<211> 75 . . ' 

<212> PRT " 
<213>. Homo, sapiens 

<400> 203 . . 

Met Ala Gly Gin Glu Asp Pro Val Gin Arg Glu lie His Gin Asp Trp 
, 1 5 • 10 -15 

Ala Asn Arg Glu Tyr lie Glu lie He Thr Ser Ser f Ile Lys Lys lie 
20 25 30 

Ala Asp Phe Leu Asn Ser Phe Asp Met Ser Cys Arg Ser Arg Leu Ala 
35 40 45 

Thr Leu Asn Glu Lys Leu Thr Ala Leu Glu Arc Arg He Glu Tyr He 
50 55 60 

Glu Ala Arg Val Thr Lys Gly Glu Thr Leu Thr 
65 70 75 , . 



<210> 204 . , . ' " ' ' . 

<211> 246 " ' 

<212> PRT : 

<213> Homo sapiens 

<220> 

<221> SITE ' ' ■ ' 

<222> (185) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 204 ■ 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie He Val Leu Pro Ser 
1 5 ' ib 15 

Asn Val He Asn Phe Ser Gin Ala Glu. Lys Pro Glu Pro"Thr Asn Gin 
20 25 30 

Gly' Gin Asp Ser Leu Lys Lys His Leu His Ala Glu He Lys Val lie 

35 -40 4 5 t~ 

Gly Thr lie Gin lie Leu Cys Gly Met Met- Val Leu Stir Leu Gly He 
5Q '55 60 . 

lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin VhZ Thr Ser 
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65 " . 70 75 ' . 8 o 

Thr Leu Leu Asn Ser Ala" Tyr Pro . Phe lie Gly Pro > Phe Phe Phe He 
85 9C 95 

lie Ser Gly Ser Leu Ser lie Ala Thr Glu r,ys Arg r,eu Thr Lys Leu 
• : ' 100 * ., 105 "\ . HO 

Leu val His Ser Ser Leu Val Gly Ser Tie Leu Ser Ala Leu Ser Ala 
115 120 -125 

Leu Val Gly Phe lie lie Leu Sor Val Lys Gin Ala Thr Leu Asn Pro 
130 . ' 135 . 140 '■ 

Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Ash He Pro Thr \rg Ser 
145 .150 155 . 160 

Tyr Val Ser Tyr ' Phe Tyr His Asp. Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 ' >170 175 

Thr Ala Lys Ala Ser Leu Ala Gly Xaa Leu Ser Leu Met Leu He Cys 
... 180 185 190 .. 

Thr, Leu Leu Glu Phe Cys Leu Ala Val Leu - Thr Ala Val Leu Arg Trp 
"195 200 • 205 

Lys Gin Ala 'Tyr Scr Asp Phe Pro Gly Ser... Val Leu Phe Leu Pro His 
210 215 220 

Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
22 5 230 . , 235 ■ 240 

Gly Tyr Glu Glu Leu Leu Thr Ser 

245 ' ' ' . 



<210> 2.05 

<211> 1GB ■ . ■■ 1 . \ 

<212> PRT 

<213> Homo sapiens . . 

<220> 1 

<221> SITE '' 
<222> (83) 

<223> Xaa equals any of the naturally occurring L-amino' acids 
<400> 205 

Met Pro L«u Leu Arg G~y Leu Leu Trp Leu Gin Val Leu Cys Ala Gly 

• x < ,5 10 • 15 

Pro Leu His Thr Glu Ala Val Val Leu Leu Val Pro Ser Asp Asp Gly 
20 ' 25 30 

Arg Ala Phe Leu Leu Arg Ser Arg Leu Leu His Pro Glu Ala His Val 
35 40 45 

Pro Pro Ala Ala' Asp Arg Gly Ala Ser Leu Gin Cys Val Leu His Gin 
50 55 60 
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•Ala Ala Pro Lys Ser Arg Pro Arg Ser Pro Ala Ala Gly Ala Ala Leu 
65 • 70 7S '80 

( leu- His Xaa Pro Arg Arg Thr Gly Asp Glu Pro Cys Arg Glu Phe His 
85 90 95 

Gly Asn Gly Phe' Pro Gly Pro . Thr Gin Leii Thr Pro Gly Glu Cys Gly 
100 , 105 110 

. Leu Pro Ala Pro Ser Ser Leu Leu Gin His Ala Ser Ala Pro Val Arg 
115 ' 120 . 125 

Thr Gly Ser Glu Gly Gin Val Val Gly Cys Pro Arg, Ala Arg Gly Glu 
130 135 140 

Thr Gly Glu Gly Leu Ser Leu Ala Phe Leu Ser Ser Leu Met Phe Thr 
14 5 150 .155 160 

Ser, Arg Asn Gly Leu Val Gly Cys 

165 • 



<210> 206 ■ 

<211> 218 

<212> PRT 

<213> Homo sapiens 

<400> 206 ' 
Met Gly Ser Ala Ala Leu Glu lie Leu Gly Leu Val' Leu Cys Leu Val 
1 -'5 10 15 

Gly Trp Gly Gly ^eu lie Leu Ala Cys Gly Leu Pro Met Trp Gin Val 
20 25 30 

Thr Ala Phe Leu Asp His Asn He Val Thr Ala Gin Thr Thr Trp Ly3 
3 5 40 45 

Gly Leu Trp Met Ser Cys Val. Val Gin Ser Thr Gly His Met Gin Cys 
* 50 55 60- , 

Lys Val Tyr Asp Ser Val Leu Ala Leu Ser Thr Glu Val Gin Ala Ala 

"•€5 ■ 70 . 75 .' 80 

Arg Ala Leu Thr Val Ser Ala Val leu Leu Ala Phe Val Ala Leu Phe 
85 90 95 

Val Thr Leu Ala Gly Ala Glh Cys Thr Thr Cys Val Ala Pro Gly Pro 
100 , 105 110 

Ala Lys Ala Arg Val Ala Leu Thr Gly Gly Val Leu Tyr Leu Phe Cvs 
115 120 125 

'Gly Leu r Leu Ala Leu Val Pro Leu Cys Trp Phe Ala Asn Tie Val Val 
13.0 135 140 

Arg Glu Phe Tyr. Asp Pro Ser val Pro Val Ser Gin Lys Tyr Glu Leu 
145 150 155 ' 160 

Gly. Ala Ala Leu Tyr Tie Gly Trp Ala Ala Thr Ala Leu Leu Met Val 
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165 170 175 

Gly Gly Cys Leu Leu Cys Cys Gly Ala Trp Val Cys Thr Gly, Arg Pro 
180 185 190 

Asp Leu Ser Phe Pro Val Lys Tyr Ser Ala . Pro Arg Arg Pro Thr Ala 
.195 200 205 

Thr Gly Asp Tyr Asp Lys Lys Asn Tyr Val ' 
* 210 215 



<210> 207 

<211> 73 '■ ' , 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (73). * 

<223> Xaa equals scop translation 

<400>~ 207 . ' ' 

Met Thr Ser Tyr lie Leu lie Ser Phe Val Leu Leu lie Gly Val' Gly 
1 5 ; 10 15 

'Cys He Glu Lys Asp Glh Ser Cys Pro Val Phe <;ly Gly Arg Lys Arg 
20 '. 25 30 . 

Leu His Leu Leu Phe Val Gly Gly Gin Leu Arg Gin Val Arg Met Leu 
.35 40 45 

Arg Gly Glu . Leu Ser Cys Ala Cys Tyr Arg Pro His Val Cln Ala Leu 
50 55 60 

Gin Leu Gly Gly Cys Thr Cys Phe Xaa 
65 70 



<210> 208 ,' 

<211> 348 

<212> PRT 

<213> Hoino sapiens '* * 

<400> 208 j 
Met Leu Cys Pro Trp Arg Thr Ala Asn Leu Gly Leu Leu Leu He Leu 
1 5 10 15 

Thr lie Phe Leu Val Aid Glu Ala Glu Gly Ala Ala Gin Pro Asn Asn 

20 25 30 , 

Ser Leu Mer. Lau Gin -Thr Ser Lys Glu Asn His Ala Leu Aia Ser Ser 

3 5 ' '40 45 

Ser Leu Cys Met Asp.Glu Lys Gin He Thr Gin Asn Tyr Ser Lys' 'Val 
50 ■ - 55 ' 60 

Leu Ala Glu Val Asn Thr Ser, Trp Pro Val Lys Met Ala Thr Asn Ala 

65 70 . ■'■ 75 00 ■ 
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Val Leu Cys Cys Pro Pro lie Ala Leu Arg Asn Leu lie lie lie Thr 

85 \ * ; 90 "■ 95. 

Trp Glu lie lie Leu Arg Gly Gin Pro Ser Cys Thr Lys Ala Tyr Lys 
100 105 110 

Lys Glu Thr Asn Glu Thr Lys Glu Thr Asn Cys Thr Asp Glu Arg rie 
115 1 • 120 125 

Thr Trp Val- Ser Arg Pro Asp Gin Asn Ser Asp Leu Gin He Arg Thr 
130 135 . 140 

Val Ala He Thr Ills Asp Gly Tyr Tyr Arg Cys He Met Val Thr Pro . 
145 , • 150 . 155 160. 

Asp Gly Asn Phe His Arg Gly Tyr His Leu Gin. Val Leu Val Thr .Pro' 
165 170 175 

Glu Val Thr.' Leu Phe Gin Asn Arg Asn Arg Thr Ala Val Cys Lys Ala ' 
180 t 185 190 

• Val Ala Gly Lys Pro Ala Ala His lie Ser Trp lie Pro Glu Gly Asp 

195' . 200 . 205 

Cys Ala Thr Lys. Gin Glu Tyr Trp Ser Asn Gly Thr Val Thr Val Lys 
210., 215 220 

Ser Thr Cys His Trp Glu Val- His Asn Val Ser Thr Val Asn Cys His 
225 • 230 . 235 240 

Val Ser His , Leu Thr Gly Asn Lys. Ser Leu Tyr He Glu Leu Leu Pro 
1 245* 250 255 

Val Pro Gly Ala Lys Lys Ser Ala Lys Leu Tyr lie Pro Tyr He He 
.260 255 270 * 

* Leu Thr lie He He Lau Thr lie Val Gly Phe He Trp Leu Leu Lys ' 

275 ' '280 '285 

Val Asn Gly Cys Arg Lys Tyr Lys Leu Asn Lys Thr Glu Ser Thr Pro 
290 295 , ' 300 

Val Val Glu Glii Asp Glu Met Gin Pro Tyr Ala Ser Tyr Thr Glu Lys 
305 310 315 320 

Asn Asn Pro Leu Tyr Asp Thr Thr Asn Lys Val Lys Ala Ser Glu Ala 
,' . . 325 '330 335 

Leu Gin Ser Glu Val Asp Thr Asp Leu His Thr Leu 
« 340 345 



<210> 209 

<211> 73 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> SITE 

<222> (73) ' 

<22 3> Xaa, equals stop translation 

<400> 209 

Met Ala Arg -Gly Cys Val Cys Ser Leu Cys Ala Ser Val Cys lie Phe 
1 5 10 15 

Leu Ser Ser Leu Phe Pro Leu Leu Pro Ser Val His Ser Val Asn lie 
20 25 30 

lie Ser Cys Leu Leu , Leu Ser Lys Cys Phe Glu Gly Leu Glu Leu Met 
35 40'. 45 

Cys Glu His Leu Tyr Gin Leu Ser GIji Leu His Val Leu His His lie 
50 ' 55 • 60 

Phe Ser Tyr Leu Leu Cys Thr Pro Xaa 



55, 


.70 


<210> 


210 


<211> 


608 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SITE 


<222> 


(265) 


<223> 


Xaa equals any of 


<220> 




<221> 


SITE 


<222> 


(597)' 


<223> 


Xaa equals any. of 


<400> 


210 


Met Val Gly Thr Lys Leu 


1 


5 



10 15 

Leu His Asp Leu Arg Pro Gin Asp Arg Phe Ser lie lie Gly Phe Ser 
20 25 30 

Asn Arg lie Lys Val Trp Lys Asp His Leu lie Ser Val Thr Pro Asp 
.' 35 40 45 

Ser lie Arg Asp Gly Lys Val Tyr He His His Met Ser Pro Thr Gly 
50 55 60 

Gly Thr Asp He Asn Gly Val Leu Gin Arg Ala He Arg Lea Leu Asn 
65 70 .75 60 

Lys Tyr Val Ala His Ser Gly lie Gly Asp Arg Ser Val Ser Leu lie 
85 . 90 95 

Val Phe Leu Thr Asp Gly Lys Pro Thr Val Gly Glu Thr His Thr Leu 
* 100 t 105 110 

Lys lie Leu Asn Asu Thr Acq Glu Ala Ala Arg Gly Gin Val Cys lie 
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115 . ' 120 125 

Phe Thr lie Gly lie Gly Asn Asp Val Asp Phe Arg Leu Leu Glu. Lys 
130 135 140 . 

Leu ser Leu Glu Asn Cys Gly Leu Thr Arg Arg Val His Glu Glu Glu 
"5. 150 155 • / . 160 

Asp Ala Gly Ser Gin Leu lie t Gly Phe Tyr Asp Glu lie Arg Thr Pro 
165 ' 170 175 

Leu Leu Ser Asp lie Arg lie Asp 'Tyr Pro Pro Ser Ser Val Val Gin 
180' 1B5 . 190 

Ala Thr Lys Thr Leu Phe Pro Asn, Tyr Phe Asn Gly Ser Glu lie lie 
195 200 205 

lie Ala Gly Lys Leu Val Asp Arg Lys Leu Asp His ;Leu His Val Glu 
210 21S 220 

Val Thr Ala Ser Asn Ser Lys Lys Phe He lie Leu Lys Thr Asp Val 
■225 230 235 , . 240 

Pro Val Arg Pro Gl.i Lys Ala Gly Lys Asp Val Thr Gly Ser Pro Arg 
245 250 255 

Pro Gly Gly Asp Gly Glu Gly Asp Xaa Asn His He Glu Arg T,eu Trp 
260 265 ' - 270 

Ser Tyr Leu Thr Thr Lys Glu Leu Leu Ser Ser Trp Leu Gin Ser Asp 
275 280 285 

Asp Glu Pro Glu Lys Glu Arg Leu Arg Gin' Arg Ala Gin Ala Leu Ala 
290 295 , 300 

Val Ser Tyr Arg Phe Leu Thr Pro Phe Thr Ser Met Lys Leu Arg Gly 
305 , 310 315 320 

Pro Val Pro Arg Met Asp Gly Leu Glu Glu Ala His Gly Met Ser Ala 
325 330 335 

Ala Met Gly- Pro Glu Pro Val Val Gin Ser Val Arg Gly Ala Gly Thr' 
340 345 350 

Gin Pro Gly Pro Leu Leu Lys Lys Pro Tyr Gin, Pro Arg He Lys Tie 
355 360 365 

Ser Lys Thr Ser Val Asp- Gly Asp Pro Hie Phe Val Val Asp Phe Pro 

370 375 380 \ \. 

Leu Ser Arg Leu Thr Val Cys Phe Asn He Asp Gly Gin Pro Gly Asp 
385' 390 395 400 

He Leu Arg Leu Val Ser Asp His Arg Asp Ser Gly Val Thr Val Asn 
4C5 410 ' 415 

Gly Glu Leu He Gly Ala Pro Ala Pro Pro Asn Gly His Lys Lys Gin 
420 425 430 
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Arg Thr Tyr Leu Arg Thr He Thr He Leu He Asn Lys Pro Glu Arg 

435' ' 44C 445 

Ser Tyr Leu Glu lie Thr Pro Ser Arg Val He Leu Asp Cly Gly Asp 

' ' 450 455 ' 460 

Arg Leu Val* Leu Pro Cys Asn Gin Ser Val Val Val Gly Ser Trp Gly 

465 / 470 475 480 

Leu Glu Val Ser Val Ser Ala Asn Ala Asn Val Thr val Thr lie Gin 

48b ■ 490 . .,495 

Gly' Ser He Ala Phe Val He Leu Her His Leu Tyr, Lys Lys Pro Ala 

' . 500 . * ' 505 .' ,510 

Pro Phe Gin Arg His His Leu Gly Phe Tyr lie Ala . Asn Ser Glu Gly 

1 •. 515 520 . 525 

.Leu Ser Ser Asn Cys His 'Gly Leu Leu Gly Gin Phe Leu Asn Gin Asp 

• 530 535.. 540 

Ala Arg Leu Thr Glu Asp Pro Ala Gly Pro Ser Gin Asn Leu Thx His 

545 550 555 560 

Pro Leu Leu Leu Gin Val Gly Glu Gly Pro Glu* Ala Val 'Leu 'Thr Val 

565 570 . ' ( 575 

Lys Gly His Gin Val Pro Val Val Trp Lys Gin Arg Lys He Tyr Asn 

580 585 590 

Gly Glu Glu Gin Xaa Asp Cys Trp' Phe Ala Arg Asn Met Pro Pro Asn 

595 600 605 . 



<210> 211 .. 

<211> 252 . 
<212> PRT . ' ^ 

<213> Homo sapiens 

<220> ■ / ' > 

<221> SITE ' ■ . ' 

<222> (252) 

<223> Xaa equals stop translation, 
<400> 211 

Met Ala Pro Ala Ser Arg Leu Leu Ala leu Trp Ala Leu Ala Ala' Val 5 

1 5 10 '15 

• . • /. 

• Ala Leu Pro Gly Ser Gly Ala Glu Gly Asp Gly Gly Trp Arc Pro Gly 
20 25 ' 30 

Gly' Pro Gly Ala Val Ala Glu- Glu Glu Arg Cys Thr Val Glu Arg Arg 
.35 40 45 > 



Ala Asp Leu Thr Tyr Ala Glu Phe Vol Gin Gin Tyr Ala , Phe' Val Arg 
50 55 60 
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Pro VaI'il G Leu Gin Gly Leu Thr Asp Asn Ser Arg Phe Arg Ala Leu 
65 . 70 75 80 

Cy3 Ser Arg Asp Arg Leu Leu Ala Ser Phe Gly Asp Arg Val Val Arg 
35 90 * 95 

Leu Ser Thr Ala Asn Thr Tyr Ser Tyr His Lys Val Asp Leu Pro Phe 
100 , 105 110 

Gin Glu Tyr val Glu -Gin Leu Leu His Pro Gin Asp Pro Thr Ser Leu 

115 ,. ! ; .. 120 125 

Gly Asn Asp Thr Leu Tyr Phe Phe Gly Asp Asn Asn Phe Thr Glu Trp 
, 130 135 '140 

Ala Ser Leu Phe Arg His Tyr Ser Pro Pro Pro Phe Gly Leu, Leu Gly 
145 150 , 155 . 160 

Thr Ala Pro Ala Tyr Ser Phe Gly He Ala Gly Ala Gly Ser Gly Val 
'165 170 175 

Pro' Phe His Trp His Gly Pro Gly Pyr Ser Glu Val lie Tyr Gly Arg 
180 185 190 

Lys Arg Trp Phe Leu Tyr Pro Pro Glu Lys Thr Pro Giu Phe His Pro 
195 " ;. 200 205' ■, 

Asn Lys Thr Thr Leu Ala Trp Leu Arg: Asp Thr Tyr Pro Ala Cys Thr 
210 215 220 

Val Cys Thr Ala -Leu Glu Cys Thr . He Arg Ala Gly Glu Val Leu Thr 
225 230 - 235 240 

Ser Arg . Pro Leu Val Ala Cys Tyr Ala Gin Pro Xaa 

245 250 ' 



<210> 212 

<211> 226 . - , . 

<212> PRT 

<213> Homo sapiens * 
<220> 

<221> SITE 
<222> (226) 

<223> Xaa equals, stop translation 



<400> 212 

Met Lys Glu He Pro Ala Leu Leu 

1 ' ■ 5 

Ala Leu Ala He Leu Ser PheCys 
20 ' 

Val Leu Arg His lie Gly Gly Pro 
35 40 

Arg Pro Arg Asp Aig Arg Arg Gin 



His Leu Pro Val Leu He He Met 

10 . 15 

■Tyr Gly Ala Gly Lys Ser Val His 

25 .; SO 

Glu Arg Glu Pro Pro Glr. Ala Leu 
45 

Glu Glu He Asp Tyr Arg Pro Asp 



WO 00/06698 PCT/US99/17130 

'*. 154 



50 55 60 

Gly Gly Ala Gly Asp Ala Asp Phe His Tyr Arg Gly Gin Met Gly ^>ro 
65 . ' 70 ■ 75 * ao 

Thr G-u Gin Gly Pro Tyr Ala Lys Thr Tyr Glu Gly Arg Arg Glu lie 
- ■ ■ 85 90 95 

Leu Arg Glu Arg Asp Val Asp Leu Arg Phe Gin Thr Gly Asn Lvs Ser 
100 . 105 110 

Pro Glu Val Leu Arg' Ala Phe Asp Val Pro Asp Ala Glu Ala Arg Glu 
115 . 120 125 

His Pro Thr Val Val Pro Ser His Lys Ser Pro Val Leu Asp Thr Lys 
130 ..• 1-35 140 

.'Pro Lys Glu Thr Gly Gly He Leu Gly Glu Gly Thr Pro .Lys Glu Ser 
ttS 150 155 . 160 

Ser Thr Glu Ser Ser Gin Ser Ala Lys Pro Val Ser Gly Gin Asp Thr 
.165 170 * ns 

Ser Gly Asn Thr Glu Gly Ser Pro Ala Ala Glu Lys Ala- Gin Leu Lys 
• 180 ;. ' 185 190 

Ser Glu Ala Ala Gly' Ser Pro Asp Gin Gly Ser Thr Tyr Ser Pro Ala 
195 200 205 

.Arg Gly Val Ala Gly Pro Arg, Gly Gin Asp Pro Val Ser Ser Pro Cys 
210 215 . • 220 

Gly Xaa 

225 ' • 



<2XQ> 213 . ■ ' '' 

<211> Si 
<212> PRT 

<213> Homo sapiens 

<220> : 
<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 213 . 

Met Met Gly Leu Leu Glu Thr Gly Asn Val Leu . Phe Trp Val Trp Val 
.1 ■ S 10 15 

Val Val Thr Cys Val Tyr Ser Leu Tyr Ala Asn Ser ' Leu Asn Cys Thr 
20 25 30 

Asp Met Asp Cys Ala Pro Phe Tyr Ket Cys Val Met Leu Gin Gin Lys 
35 40 45 

Cys Gin Xaa , . 

50 



PCT/US99/17130 
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■ <210> 214s, v ' V/ 

<211> 172 
<212> PRT . 
<213> Homo sapiens 

<220> * . . . : v 

<221> SITE 

<222> (172) . , • 

<223> Xaa equals stop translation ,' . 

<400> 214 '. 
Met Trp Leu Trp Ala Val Ser Pro Val Arg Pro Arg Thr Cys Leu Pro 
4 5 .'10 15 

Pro Cys Pro Arg Leu Trp Leu Trp lie Ser Met Thr Leu Val, Pro Ser 
: ' 20 25 30 

Ser Ser- Ala Trp Lys Ser His Gly Ala Pro Ser Thr Arg Met Thr Ser 
35 40. 45 

Pro Gin Leu Leu Leu Leu Ser Thr Arg Pro Pro Gin Ser Pro Ser Ala 

. • • 50 . 55 .. •' . 60- , 

Ser Pro Pro lie Ala Arg Ala His Arg Thr His Pro His Phe Gly Asn 
65 70 • 75 80 

Arg Leu. Ser Tie Thr Cys; Cys Asp Gly . Arg Arg Ser Trp Arg Met Gly 
85 90 95 

Gin His Gly Pro Cys His Leu Asn Leu Gin Thr' Thr His Pro Ala His 
100 ; 105 • :* .110 

Ser Ser Gin Ala Leu Pro Ala* Thr, His Gin Pro Leu Gly Pro Trp Cys 
•115 120 125 

Ser Ser Pro. Ser Pro Phe Pro Ser Lys Leu Pro Ser Ala Gly Lou Arg 
.13 0 13 5 140 

Pro Pro Ala Leu Gly Pro Trp Met Arg Arg Gly ' Pro Trp Pro Gin Ser 
145 ISO ,155 160. 

Trp Gin Met Gly Met His Pro Thr Val Gly Leu Xaa 
' 165 •' . .170 



<210> 215' 

<2ii> 4R . , . ; 

<212> PRT 

<213> Homo sapiens " * . - 

<220> , .. . . ' ' •' 

<221> SITE . 

<222> (48) , 

<223> Xaa equals stop translation 

<400> 215 - 1 

Met Trp Leu Leu He lie Phe Cys Lys Ser Ala Ser Ala Ser Val Leu 
1 5 10 15 
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Cys Trp lie Lys Lys Phe His Pro Val Phe Gin Glu Ser Leu Leu Tyr 

20 25 .30 

Leu Val Gin Glu Gly Ser Leu Cys Tyr Val Gin Gin Lys Val Pro -Xaa 

35 \ ! 40 45 



<210> 215 . 
<211> 139 • ' ' 
<212> PRT • ' 
<213> Homo sapiens 

<400> 216 

Met Glu Ala Val Val Phe Val Phe Ser Leu Leu Asp Cys Cys Ala Leu 
1 5 10 15 

lie Phe Leu Ser Val Tyr Phe He He Thr Leu Ser Asp Leu Glu Cys 
2C 25 30 

Asp Tyr lie Asn Ala Arg Ser Cys Cys Ser Lys Leu Asn Lys Trp Val 
35 40 45 

lie,, Pro Glu Leu He Gly His Thr He Val Thr Val Leu Leu r..«u Met 
■ 50 55 60 

Ser Leu His Trp Phe He Phe Leu Leu Asn Leu Pro val Ala Thr Trp 
65 70 75 80 

Asn lie Tyr Arg Tyr He Met Val Pro Ser Gly Asn Ket Gly Val Phe 
.. 85 90 95 

Asp Pro Thr Glu He His Asn Arg Gly Gin Leu Lys Ser His Met Lys 
'100 105 110 

Glu Ala Met He Ly3 Leu Gly Phe His Leu Leu Cys Phe Phe Met Tyr 
115 120 .125 

Leu Tyr Ser Met He Leu Ala Leu lie Asn Asp 

130 135 • • 



<210> 217 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> ' \ 

<221> SITS . 

<222> (41) ^ , 
<223> Xaa equals stop translation 

<400> 217- ' 

Met Ser Gly Ser Ser Leu . Pro Ser Ala Leu Ala Leu Ser Leu Leu Leu 
1 f 5 10 * 15 
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Val Ser Gly ser Leu Leu fro Gly Pro Gly Ala Ala Gin Asn Val Arg 
20 25 30 

Val Gin Ser Gly Gin Asp Gin Lya Xaa 

. 35 40 ' . 

<210> 218 * , 

<211> 52 

<=212> prt . ' 

<213-> Homo sapiens 

<220> 

<221> SITE 
<222> (52) 

<223> Xaa equals stop translation 
<400> 218 ' 

Met Pro Ser Hia lie Arg Ala His Leu Phe Leu Leu Leu Phe Phe Leu 

1 ■ 5 ' 10 •. , ' 15 . • • . 

Phe He Tyr Gin Gly He Ser Ser lie Ser Gin Ala Ser Gly Leu Thr 

20 25 30 . 

Leu Lys Thr Gin Asn clu Lys Asp He Gin Val Ser He Leu Lys Glu 
35 40 45 

Phe Val Val Xaa f , 

50 , 

<210> 219 

<211> 49 

<212> PRT ' 

<213> Homo sapiens 

<220> 

<221> SITE ' , .'■ • 

<222> (49) 

<223> Xaa equals stop translation 

<400> 219 / ' ' * ■ 1 

Met Cys He Tyr Gin Ser Glu Gin Met Leu Ala "lou Leu Leu Val Leu 
1 5 10 is 

val Phe Cys He Ser Leu Leu Val Leu Val Cys Trp Gly Ser His Asn 
20 25 30, 

Lys Val Pro Gin Lys Phe lie Phe Ser Gin Phe Trp Gly Leu Glu Asp 
35 ,.40 45 ' 

Xaa ' 



<210> 220 

<211> 42 

<212> PRT 

<213> Homo sapiens 
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<220> • . "". 

<22l> SITE ' ' ■ ' ' ■ ' 1 

<222> (42) ' • 

<221> Xaa equals s-iop translation ' . , 

<400> 220 ■ ' 

■ Met Ala Val Pro Leu. Phft LeuTyr lis Phe Thr Leu Leu Pro Leu Leu 

1 . ^ 5 10 . 15 

Pro Phe Leu Leu Ser Leu Cys Phe Ser Pro Leu Thr Val Lys Arg Ser 
, 20 25 ' 30 

Ser Ser Ser Glu Ser Lys Ser Ser Leu Xaa - * 

.. .. ; 35 . • 40 ( ■ ' • 1 

<210> 221 . . • . 

<211> 41 

' <212> PRT , . ' , 

<213> Homo sapiens 

<220> " .• . 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 221 ~ • * ■ 

Met Gly Met Leu Leu Ala Phe Trp Leu Pro Gly Ala Ser Trp Gin Glu 
'• 1 5 10 is 

Ala Gly Pro Arg Ala Ser Thr Gin Arg Met Arg Thr Gin Thr Gin Met 

' • 20 25 . 30 '• . : 

Ser Thr Arg Lys. Pro Lys Pro Ala Xaa ' t .. 

. 35 ' 4Q ; ' j * ' 

<210> 222 1 . • 

<211> 43 • , 

<212> PRT 

. <213> Homo sapiens , 1 

<220> ' , 

<221> SITE 

<222> (43) ' 

<223> Xaa equals stop translation <■.*•■ \ 
<400> 222 , , 

Met Glu Pro Ala Ke- Val Leu Lys Phe Leu Ser' Ser Leu Pro Glu. Ash 
, 5 " ■.' 10 . .15 • 

Leu Phe Leu Pro Ser; Leu Leu Phe Phe Ala Trp Leu Cys" Trp Asn Met 
20 25 . 30 . . 

Val Cys Gly Ser Pro Val Ser Cvs Pro Tyr Xaa ' 

35 40 - ' : • " : 
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<210> 223. ' 
<2I1> 204 ■ * 
<212> PRT 
<213> Homo sapiens 

<220> . ' _ 

<22I> SITE ' ' , ' 

<222> (204) . ' *" 

<223> Xaa equals scop translation' 

<400> .223 

Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Trp Gly Cys • 
1 ; 5 ■ 10 '15 

Phe Ser Gin Leu Met Leu Tyr Aid Glu Arg Ala Glu Ala Arg Arg Lys 
20 25 30 

Pro Asp lie Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala .Val ! . 
35 40 "45 

Leu Cys Ala ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
50 ^ 55 60 

Leu Gly Ala Ala Leu Gin Leu Ala lie Ser Thr Tyr Ala Ala Tyr lie 
65 '70 75 ' 80 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyf ser 
85 90 95 

Arg Thr Val Ala lie lie Gly Gly Phe Leu Val Leu Ala Ser Gly Ala 
100 ' 105 , 110 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg' Ser Arg Ser Leu Gin Ser Thr 
115 120 125 * 

Gly Gin Val Phe Leu Gly He Tyr Leu lie Cys Val Ala Tyr Ser Leu 
130 135 ■ 140 . 

Gin His Ser Lys Glu' Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
, 14S 150 ■ 155 160 

'Gly Glu Leu Met lie Gin Leu Phe Phe Val Leu Tyr Gly lie Leu Ala 
165 170 175 

Pro Gly Leu Ser Val Arg Leu Leu Arg Asp Pro Arg Cys Pro Asp Pro 
■ 180 185 190 

Gly Cys Thr Ala Ala Pro Cys His Ala Ala His Xaa 
195 . 200 * 

<210> 224 .. 
<211> 43 
<212> PRT 

<213> Homo sapiens r 
<220> 

<221> SITE 
<222> <43 ) 
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. <223> -Xd.ci equals scop translation 
<400> 224 

Met Arg Val Arg lie Gly Leu Thr Leu Leu Leu Cys Ala Val Leu Leu 

•1 • •' ' 5' • 1 10' 15 

r Ser Leu Ala Ser Ala -Ser Ser Asp Glu Glu Gly Ser Gin Asp Glu Ser 
20 25 30 

Leu Gly Phe Gin Asp Tyr Phe Asp lie Arg Xaa 
35 . 40 



. <210> 225 
<211> 156 . ■ 

<212> PRT 
<213> Homo sapiens 

<220> 
<221> SITE 
<222> (156) 

<223> Xaa equals stop translation . * 
<400> 225 . ^ 

Met Ala Arg Gly Ser Leu. Arg Arg Leu Leu Arg Leu Leu Val Leu Gly 
15 10 15 

Leu Trp Leu Ala Leu Leu Arg Ser Val Ala Gly Glu Gin Ala Pro Gly 
20 .25 30 

Thr Ala Pro Cys Ser Arg Gly Ser Ser Trp Ser Ala Asp Leu Asp Lys 
35 40 45 

l Cys Met Asp Cys Ser Thr Ser Cys Pro Leu Pro Ala Ala Leu Ala His 
50 55 60 

Pro Trp Gly . Arg Ser Glu Pro Asp Leu Arg, Ala Gly Ala Ala Phe Trp, 
65 ' , 70 75 80 

Leu Phe Gly Leu Glu Thr Met Pro Gin Glu Arg Glu Val His His Pro, 
85 90 95 

His Arg Gly Asp Arg Arg Arg Gly Leu Pro Ser Cys Gly Ala Asp Pro 
100 105 ' 110 

val Thr Met Cys Pro Leu Pro Ala Gly Ala Arg Pro Leu lie He His 
■115 120 125 ■ 

Ser Ser lis Leu Glu Pro Val Ser Ala Ser Gin Thr Arg Arg Glu Pro , 
130 135 140 

( Ser,Ser Ser Asn Kis Lys Gly Gly G:y Gly Arg Xaa 
* 145 150 155 



<210> 226 

<211> 74 ; 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 

<222> (38) , - 

<223> Xaa equals any of the naturally occurring L-anino acids 

<220> ' ' 

<221> SITE 

<222> (48) ' * ( * 

<223> Xaa, equals any of the- naturally occurring L-amino acids 

<220> ... . ' 

<22i> SITE ' " ' . 

<222> (54)' ' ' ' 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> ' ' •• •. ' ••' ' ,•• * 

<221> SITE 

<222> (55) ' 

<223> Xaa equals any of the naturally occurring L-amino acids f 

<220> . ' 

<221> SITE 

<222> (68) .; " _ •. 

<223> xaa equals, any of the naturally occurring L-amino acids 

<220> . ' 
<221> SITE • 
<222> (74) 

<223> Xaa equals stop translation ; 

<400> 226 • ■ ■ . ' v * > : 

Met Phe Tyr Lys Leu Thr Leu lie Leu Cys Slu Leu Ser Val Ala Gly 
1 5 10 15 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin Arg Leu Pro' Arg His • 
" 20 25 30 

He Cys Ser Gin Arg Xaa Pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa 
35 40 45 

Pro Asp Lys Profile Pro Arg Leu Ser 

. • • 60 

Lys Arg Xaa. " . * 



<210> 227 

<211> 167 , - 

<212> PRT / -' 

<213> Homo sapiens • 

<220> 

<221> SITE 
<222> (167) 

<223> Xaa equals stop translation 



Leu Gin Thr Thr Trp Xaa Xaa 

50 55 

Pro Pro Leu Xaa Ser Asp Pro 
65 70 



<A00> 221 
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Met Gly Ser Arg Phe Leu Leu Val Leu Leu Ser Gly Leu Thr Val Leu 

1 ■ - .5 io 15 . 

Leu Ala Leu Pro Gly Ser Glu Ala Lys Asn Ser Gly Ala Ser Cvs Pro 
20 25 ' 30 

Pro Cys Pro- Lys Tyr Ala Ser Cys His Asn Ser Thr His Cys Thr Cys 
35 40 45 

Glu Asp Gly Phe Arg Ala Arg Ser Gly Arg Thr Tyr. Phe His Asp Ser 
50 55 SO 

Ser Glu Lys Cys Glu Asp Ilo Asn Glu Cys Glu Thr Gly Leu Ala Lys 
65 • 70" 75 80 

Cys Lys. Tyr Lys Ala Tyr Cys Arg Asn Lys Val Gly Gly Tyr rle Cys 
85 90 95 

Ser Cys Leu Val Lys Tyr Thr Leu Phe Asn Phe Leu Ala Gly lie lie 
100 105 no 

Asp Tyr Asp His Pro Asp Cys Tyr Glu Asn Asn Ser Gin Gly Thr Thr 
115 120 125 

. Gin Ser Asn Val . Asp lie Trp Val. Ser Gly Val Lys Pro Gly Phe Gly 
130 135 140 

Lys Gin Leu Val Arg lie Thr Met Pro Phe Ser tyr' Pro Asn lie Asn 
145 150 ; 155 160 

Met Ser Ser Cys . Asp Phe JCaa 
165 



<21Q> 228 . ' 

<2U> 71 

<212> PRT ' 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (71) 

■ <223> Xaa' equals stop translation 
<400> 228 

Met Lys Pro Lys His Leu Glu Trp Cys Leu Ala His' Ser Trp Cys -Val 
1 '5 ' 10 is ' 

lie Trp Leu Ser Phe Val Ser Pro Pro Thr Ser His Leu Glu Cys Asp 
20 25 • 30 

Gly Phe Pro Gly Ser Leu Leu Pro Pro Cys Clu Glu Gly Arg Cys Phe 
35 40 ■ 45 

Pro Phe Thr Phe His His His Asp Cys His Gly Cys Ser Pro Leu Gin 

■50 1 - - 55 ' 60 *' 



Ser, Ser. Pro Gly Gin His Xaa 
70 
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<210> 229 , ■ .♦ ■ : 

<211>-'273 t _- ' - 
<212> PRT t _ 
; <213> Homo sapiens , 

<220> " - , ' ' 

<221> SITE • 

<222> (273) . •• 

<223> Xaa equals stop translation 

- <400> 229 • , • 

Met CysCys Trp Pro Leu Leu Leu Leu Trp Gly Leu Leu Pro Gly Thr 
i: 5 • 10 15 

Ala Ala Gly Gly Ser Gly Arg Thr Tyr Pro His Arg Thr Leu Leu Asp 
20 25 - 30 

Ser Glu'Gly Lys Tyr Trp Leu Gly Trp Ser Gin Arg Gly Ser Gin lie 
35 4 0 45 . 

Ala Phe Arg Leu Gin Val Arg Thr Ala Gly Tyr Val Gly Phe Gly Fhe 
50 55 V 60 

Ser Pro Thr Gly Ala Met Ala Ser Ala Asp lie Val Val Gly Gly Val 
65 70 • 75 80 

Ala His Gly Arg Pro Tyr Leu Gin Asp Tyr Phe Thr Asn Ala Asn Arg 
85 . 90 ' , 95 

Glu Leu 'Lys Lys Asp Ala Gin Gin Asp Tyr His Leu Glu Tyr Ala Mat 
•100- . ' 105. ' ; ' • 110 . 

Glu Asn Ser Thr His Thr lis He Glu Phe Thr ' Arg Glu Leu His Thr 
11^ 120 ' 125 V ., 

, Cys Asp lie Asn Asp Lys Ser lie Thr Asp Ser Thr Val Arg Val He v 
130 135 140 

Trp Ala Tyr His His Glu Asp Ala Gly Glu Ala Gly Pro Lys Tyr His 
145 ■ 150 155 ' 160 

Asp Scr Asn Arg Gly Thr Lys Ser Leu Arg Leu Leu Asn Pro Glu Lys 
165 170 175 

Thr Scr Val Leu Ser Thr Ala Leu Pro Tyr Phe Asp Leu Val Asn Gin 
- 180 185 190 

Asp Val Pro He Pro Asn Lys Asp Thr Thr Tyr Trp Cys Gin Met Phe 
195 '200 205 

Lys He Pro Val Phe Gin Glu' Lys His His Val He Lvs Val Glu Pro 
210 215 220 

Val He Gin Arg Gly His Glu Ser Leu Val His His* He Leu Leu. Tyr 
225 230 ,235 ' 240 

Gin Cys Ser Asn Asn Phe Asn Asp Ser Val Pro Gly He Arg A^la. Arg 
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245 



255 



lie Ala lie Thr Pro Thr Cys Pro Met His Ser Ser Pro Val Lys Leu 
• 260 265 270 



<210> 230 • 
<211> 82 

<212> PRT , 

<213> Homo sapiens ^ 

<220> ' 
<221> SITE 
<222> (82) 

<223> Xaa equals stop translation ■ 
' <400> 230 

Met Arg, Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val Leu Leu 
1 1 ' ' S 10 . L5 

Val Gly Leu Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Gin Pro Val 
20 25 30 

Cys Arg Gly Gly Thr Gin Arg Pro Cys Tyr Lys Val lie Tyr Ph*? His 
35 40 45 ' 

Asp Thr Ser Arg Arg Leu Asn Phe Glu Glu Ala Lys Glu Ala Cys Arg 
50 . •- 55 60„ 

.Arg Gly Trp Arg Pro Ala Ser Gin His Arg Val Leu Lys Met Asn Arg 
65 70 75 80 

Asn Xaa '. . 



. <210> 231 •■ , . 

<211> 71 
<212> PRT 
<213> Homo sapiens 

. <220> * 1 ' 

<221>, SITE' 
<222> ( 3 B ) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 231 

Met Ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr t Ala Val 
1 ' 5 .10 15 

.Gly Ala Ala Val lie Leu Ala Gin Leu Leu Arg Arg Cys Arg Gly Gly 
20 25 30 . 

Phe Leu Glu Pro Val Xaa Pro , Pro Arg Pro Asp Arg Val Ala He Val 
; 35 40 ,45 . 
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Thr Gly Gly Thr Asp Gly He Gly Tyr Ser Thr Ala Asn Tie Trp Arg 
50 55 60 

Asp Leu Ala Cys Met Leu Ser 
65 ' 70 1 



. <=210> 232 ' 
<211> 225 
<212> PRT 

<213> Homo sapiens 

'*• <220> ' • 

<22l> SITE •• 
<222> ib) 

<22 3>* Xaa equals any of the naturally occurring L -amino acids 
<400> 232 

.His Glu Arg Ala xaa Gly fro'ser Arg Gly His Gly Glu Leu Leu Ser 

1 , . 5 ■ 10 15 

■ Cys Val Leu Gly Pro Arg Leu Tyr Lys lie Tyr Arg Glu Arg Asp Ser 
20 25 30 

Glu Arg Ala Pro Ala Ser Val Pro Glu Thr Pro Thr Ala Val Thr Ala 
35 .-■ 40 45 

.Pro His Ser Ser Ser Trp Asp .Thr Tyr Tyr Gl'n Pro Arg Ala Leu Glu - 
50 55 SO 

Lys His Ala Asp Ser lie Leu Ala Leu Ala Ser Val Phe Trp Ser lie 
65 . . 70 75 80 

Ser Tyr Tyr Ser* Ser Pro Phe Ala Phe Phe Tyr Leu Tyr Arg Lvs Gly 
85 90 95 

Tyr Leu- Ser Leu Ser Lys Val Val Pro ?he Ser His Tyr Ala Gly Thr 
100 105 110 

Leu Leu Leu Leu Leu Ala Gly Vol Ala Cys Ser Glu Ala Leu Ala Ala 
115 ' 120 125 

Gly Pro Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser Trp Lys Gin His 
130 135 140 • 

lie Gly Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu Pro Thr Thr Thr 
I 45 ISO . • 155 • 160 

Leu Thr Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly Lys Asn Pro Ala 
• 165 . • 170 175 

Ala Gly Arg Scr Leu Glu Gly Ala Leu Pro Ala Gly Val Trp Pro Cys 
180 185 190 ' 

Phe Ala Gin Ser Pro Cys Thr Gly Gly Gin Gin Thr Pro Ser Ser Thr 

^95 200 . ( . . ,205 ... 

Gly Leu Arg Ser Cys Leu Val Arg Ser Pro Ala Thr' Trp Trp Arg Thr 
210 215 220 
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Pro 

225 



<210> 233 

<211> 314 " . 

<212> PRT ( . 
<213> Homo sapiens 

<220> . • 

<221> SITE ■ - 

<222> (147) ' ■ '. ; 

<223> xaa equals any of the naturally occurring L-amino acids 

<220> ■ ' 

<221> SITE * 

<222> (211) ' 

<:223> xaa equals any of the naturally occurring L-amino acids 

<400> 233 . 

Met Leu Pro Ala Arg Leu Pro Phe Arg Leu Leu Ser Leu Phe Leu Arg 

1 . .5. 10 ,. 15 

Gly Ser Ala Pro Thr Ala 'Ala. .Arg His Gly Leu Arg Glu Pro Leu Leu 
20 25 30 

Glu Arg Arg Cys Ala Ala Ala Ser Ser Phe Gin His Ser Ser Ser Leu 
35 40 45 

Gly Arg Glu Leu Pro Tyr Asp Pro Val Asp Thr Glu Gly Phe Gly Glu, 
50 55 60 ' 

Gly Gly Asp Met Gin Glu Arg Phe Leu Phe Pro Glu Tyr lie Leu Asp 
65 70 75 80 

Pro Glu Pro Cln Pro Thr Arg Glu Lys Gin Leu Gin Glu Leu Gin Gin 
85 90 - 95 

Gin Gin Glu Glu Glu Glu Arg Gin Arg Gin Glr. Arg Arg Glu Glu Arg 
100 105 110 

Arg Gin Gin Asn Leu Arg Ala Arg Ser Arg Glu His Pro Val Val Gly 
115 • 120 125 

His Pro Asp Pro Ala Leu Pro Pro Ser Glv Val Asn Cys Ser Gly Cys 
130 135 * 140 

Gly Ala Xaa Leu i:is Cys Gin Asp Ala Gly Val Pro Gly Tyr Leu' Pro 
145 ISO , • 155 160 

Arg Glu Lys Phe Leu Arg Thr Ala Glu Ala Asp Gly Gly Leu Ala Arg 
165 170 175 

Thr Val Cys Gin Arg Cys Trp Leu Leu Ser His His Arg Arg Ala* Leu 
180 185 190 

Arg Leu Gin /Val Ser Arg Glu Gin Tyr Leu Glu Leu Val Ser Ala Ala 
195 200 205 
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Leu Arg Xaa Pro Gly Pro Bar Leu Vai Leu Tyr Met Val Asp Leu Leu 
210 . 215 . 220 

Asp Leu Pro Asp Ala Leu Leu Pro Asp Leu Pro Ala Leu Val Gly Pro 
22 5 1 230 23 5 ■ '240 ' 

Lys Gin Leu lie Val Leu Gly Asn Lye Val Asp' Leu Leu Pro Gin Asp 

245, * 250 ' * ' " 255 

ALa Pro Gly Tyr Arg" Gin Arg Leu Arg Glu'Arg Leu Trp Glu Asp Cys 
260 265 270 

Ala Arg Ala Gly Leu Leu Leu Ala Pro. Gly Thr Lys' Gly His Ser Ala 
275 ' 280 285 

Pro Ser Arg Thr Ser His Arg Thr Gly Arg lie Arg Tie Arg Arg Thx 
290 2 95 300 

Gly Pro Ala Gin Trp Ser Gly Thr Cys Gly 
305 • ' 310 



<210> :234 : 

<211> 93 ' ; •• • 

<212> PRT , 

<213> Homo sapiens * „ 

<400> 234 , , 

Met Arg Pro Gin Gly Pro Ala Ala Ser Pro Gin Arg Leu Arg Gly Leu 

. 1 • 5 10 15 ■• ' 

i < ' , . 

Leu,' Leu Leu Leu Leu Leu Gin Lou Pro Ala Pro Ser Ser Ala Ser Glu 
20 25 ' 30 

He Pro Lys Gly Lys Gin Lys Ala His Ser' Gly Arg Gly Arg Trp Trp 
35 40 45 

Thr Cys He Met Glu Cys Ala Tyr Lys Gly Gin Gin Glu Cys Leu Val ' 
50 • 55 60 

Glu Thr Gly Ala* Leu Gly Pro Met, Ala Phe Arg Val His Leu Gly Ser 
65 70 . 75 • 80 

Gin Val Gly Met Asp Ser Lys Glu Lys Arg Gly Asn Val 

85 • 90 • . 



<210> 235 t . > 

<211> 73 , ■ . 

<2L2> PRT 

<213> Homo sapiens 

<220> 

<221> SITE ' * 
<222> (73 ) 

<22 3> Xaa equals stop translation 



<400> 235 
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Met Gly Ser Ala Ala Leu Glu lie Leu' Gly Leu Val Leu Cys Leu Val 
1 5 10 is 

;Gly Trp Cly cly Leu lie Leu Ala Cys Gly Leu Pro Met Trp Gin Va 1 
20 25 . " 30 

Thr Ala Phe Leu Asp His Asn lie Val Thr Ala Gin Thr Thr Trp Lys' 
35 40 , 45 

Gly Leu Trp Met Ser cys val, Val Gin Ser Thr Gly Thr Cys Ser Ala 
t>0 . 55 60 

Lys Cys Thr Thr Arg Cys Trp Leu Xaa 
65 70 



<210> 236 

<211> 349 " ' ' - . ' . 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 1 
<222> (283) 

<223> Xaa equals any of the naturally occurring L -amino acids 
<220> 

<221> SITE " 
<222> (293) 

<223> Xaa equals any of the' naturally occurring L-amino acids 
<220> 

<221> SITE ....*' 
<222> (325) 

<223> Xaa equals any of the naturally occurring L-ajnino acids 
<220> 

<221> SITE 
<222> (326) ■ 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> (349) ' 
<223> Xaa equals stop translation 



Asn Leu Gly Leu Leu Leu lie Leu 
. 10 ' 15 

Thr He Phe "Leu val Ala Glu Ala Glu Gly Ala Ala Glh Pro Asn Asn 
20 25 30 

Ser Leu Met Leu Gin Thr Ser Lys Glu Asn His Ala Leu Ala Ser Ser 
35 40 45 



<400> 236 

Met Leu Cys Pro Trp Arg Thr Ala 
1 5 



Ser Leu Cys Net. Asp Glu Lys Gin lie Thr Gin Asn Tyr Ser Lys Va" 
50 55 60 
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Leu Ala GIu Val Asn Thr Ser' Trp Pro Val Lys Met Ala Thr Asn Ala 

65" ■ 70 75 ■' ■ • 80 ' 

Val Leu Cys Cys Pro Pro He Aid Lea Arg Asn Leu lie He lie Thr 
85 90 • 95 

Trp Glu lie lie Leu Arg Gly, Gin Pro Ser Cys Thr Lys Ala Tyr Lys 
100 - ' 105 " iio 

' Lys Glu Thr Asn Glu Thr Lys Glu Thr Asn Cys Thr Asp Glu- Arg He 
: . H5 , 120 125 

Thr Trp Val Ser Arg Pro Asp Gin Asn Ser Asp Leu, Gin lie Arg Thr 
130 135 140 ■ . ' 

Val Ala lie Thr His Asp Gly Tyr Tyr Arg Cys lie Met Val Thr Pro 
145 150, • 155 -60 

Asp Gly Asn Phe His Arg Gly ,Tyr His Leu 'Gin Val Leu Val Thr Pro 
165 170 : 175 

Glu Val Thr Leu Phe Gin Asn Arg Asri Arg Thr Ala Val Cys Lys Ala 
ISO 185 190 

Val Ala Gly Lys Pro Aid Ala His He Ser Trp He Pro Glu Gly Asp 
195 • 200 205 

Cys Ala Thr Lys Cln Glu Tyr Trp Ser Asn Gly Thr Val Thr Val Lys 
210 215 220 

Ser Thr Cys His Trp Glu Val His Asn Val Ser Thr Val Asn Cys His 
225 230 , 235 240 

Val Ser His Leu Thr Gly Asn Lys Ser Leu Tyr lie Glu Leu Leu Pro 
245 250 255 

Val Pro Gly Ala Lys Lys Ser Ser Lys Leu Tyr He Pro Tyr He He 
260 265 270 

Leu Thr He He He Leu Thr He Val Glv Xaa He Trp Leu Leu- Lys 
275 280 285 

Val Asn Gly Cyc Xaa Lys Tyr Lys Leu Asn Lys Pro Glu Ser Thr Pro 
290 295 300 

Val Val Glu Glu Asp Glu Met: Gin Pro Tyr Ala Phe Tyr Thr Glu Lys 
305 ... 310 315 320 

Asn Asn Pro Leu Xaa Xaa Thr Thr Asn > Lys Val Lys Ala Ser Glu Ala 
325 330 335 

Leu Gin Ser Glu Val Asp Thr Asp Leu His Thr Leu Xaa 
340 345 



<210> 237 

<211> 17 

<212> PRT 

<2I3> Homo sapiens 
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-<400> 237 , 

Leu Ala Leu Tyr Ser Ala Leu Phe Ser Tyr Ser Gly Trp Asp Thr Leu 

t 1 ,5 10 15 

Asn 



<210> 238* fc 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 238 

Val Thr Glu Glu lie Lys Asn Pro Glu Arg Asn Leu Pro Leu 
1 5 10* 



<210> 239 * 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 239 

lie Gly lie Ser Met Pro lie Val Thr 
1 .5 



<210> 240 it ■ ■ 

<211> 13 ' • ' 1 . 

<212> PRT ' . 

<213> Homo sapiens ' . ' 

<400> 240 ' 

lie Tyr lie Leu Thr Asn Val Ala Tyr Tyr Thr Val Leu ■ 
1 5 10. 



<210> 241 

<211> 11 

<212> PRT ' 

<213> Homo sapiens 

<400> 241 

Ser Asp Ala Val Ala Val Thr Phe Ala Asp Gin 
1 5 10 



<210> 242 

,<211> 13 • ' 
<212> PRT •*. * 

<213> Homo sapiens , . 

<400> 242 ' , r ■ 

Val Ala Leu Ser Cys Phe Gly, Gly Leu Asn Ala Ser He 
1 5 • 10 
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<210> -243 * " ' 

. <2I1> 15 
<2I2> PRT ' ^ 
<213> Homo- sapiens 

. <400> 243 

Ser Arg Leu ^Phe Phe Val Gly Ser Arg Glu Gly His Leu Pro Asp 

1 '• '5 10 . ■ 15 

; . . 

<210> 244 

■ <2ii> li 

<212> PRT *__.■-, 
<213> Homo sapiens . 

<40C> 244 

Ser Phe Ser Tyr Trp Phe Phe Val Gly Leu Ser 

1 5 10 . . 



<210> 245". 

<211> 11 ' . 

<212> PRT ■ 
<213> Homo sapiens 

<400> 245 

Val Gly Gin Leu Tyr Leu Arg Trp Lys Glu Pro 

.1 5 : ; 10 



<210> 246' ' 
<211> IS 

<212> PRT • 
<213> Homo sapiens 

<400> 246 

Arg Pro Arg Pro Leu Lys 
• 1 5 



leu Ser Val Phe Phe Pro lie val Phe Cys 

10 • * 13 



.<210> 247 

<211> 9 

<212>.PRT 

<213> Homo sapiens 

<400> 247 

Asp Thr He Asn Ser Leu lie Gly lie 

1 . - 5 



<210> 248 

<211> 44 

<212> PRT 

<2l3> Kono sapiens 



<400> 248 
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Ala Thr Ala Leu Pro Pro Lys lie Val Gly Ser Ala Thr Arg Tyr Leu 

1 . 5 10 15 • 

Gin Val Leu Cys Mec Ser Val - Ala Ala Glu Met Asp "Leu Glu Asp Gly 
20 25 30 

Gly Glu Met Pro Lys Gin Arg Asp Pro Lys Ser Asn 

■ 35 ■ ' ',40 •■• ' '■' " 



<210> 249 

<211> 352 , ' , 

• <212> PRT 
<213> Homo sapiens 

<400> 249 . 

Leu Leu Ala Ala Ala Cys, He Cys Leu Leu Thr Phe lie Asn Cys Ala 

. i s io - 15 ; 

Tyr Val Lys Trp Gly Thr Leu Val Gin Asp lie Phe Thr Tyr Ala Lys 
2 0 25 3 0 

Val Leu Ala Leu He Ala Val He Val Ala Gly rle VaL Arg Leu Gly. 
• . 35 40 45 

Gin Gly Ala Ser Thr His Phe Glu Asn Ser Phe Glu Gly Ser Ser Phe 
.50 55 • 60 

Ala Val Gly Asp He Ala Leu Ala Leu Tyr Ser Ala Leu Phe Ser Tyr 
. 65 • - 70 75 80 

Ser Gly Trp Asp Thr Leu Asn Tyr Val Thr Glu Glu lie Lys Asn Pro 
85 90 ' 95 

Glu Arg Asn Leu Pro Leu Ser lie Gly He Ser. Met Pro lie 'Val Thr 
' ' ' 100 _ ' ■ 105 HO 

Tie He Tyr II© Leu Thr Arm* Val Ala Tyr Tyr Thr Val Leu Asp Met 
. 115 120 125 

Arg Asp He Leu Ala Ser Asp Ala Val Ala Val Thr Phe Ala Asp Gin 
130 135 140 ' .. 

lie Phe Gly lie Phe Asn Trp He He Pro Leu Ser Val Ala Leu Ser. 
145 150 155 160 

Cys Phe Gly Gly Leu Asn Ala Ser lie Val Ala Ala Ser Arg Leu Phe 
165 170 ,175 

Phe Val Gly Ser Arg Glu Gly His Leu Pro Aso Ala He Cys Met lie 
180 185 190 

His Val Glu. Arg Phe Thr Pro Val Fro Ser Leu Leu Phe Asn Gly He 
195: 1 \ '200 2C5 

Met Ala Leu lie Tyr Leu ' Cys -Val ' Glu Asp He Phe Gin Leu Ila Asn 
210 - 215 220 

Tyr Tyx Ser Phe Ser Tyr Trp Fhe, Phe Val Gly Leu Ser He Val Gly 
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225 230 235 240 

Gin Leu Tyr Leu Arg Trp Lys Glu Pro 'Asp Arg'Pro Arg Pro Leu Lys 
245 250 255 

Leu Ser Val Phe Phe Pro lie Val Phe Cys Leu Cys Thr lie Phe Leu 
■ ■ ■ 260 . 265 270 

Val Ala Val ' Pro' Leu Tyr Ser Asp Thr He Asn. Ser Leu He Gly He 
275 ' 280 285 

Ala lie Ala Leu S«=r Gly Leu Pro Phe Tyr Phe Leu. lie lie. Arg Val 
290 ... ' 295 300 

Pro Glu His Lys Arg Pro Leu Tyr Leu Arg Axg lie Val Gly Ser Ala 
305 310 315 1 320 

Thr Arg Tyr Leu Gin Val Leu Cys Met Ser Val Ala Ala Glu Met Asp 
325 330 335 

Leu Glu' Asp Gly Gly Glu Met Pro Lys Gin Arg Asp Pro Lys Ser Asn 
340 , 345 350 



<210> 250 

<2ll> 119 ■ ■ . 

<212> PRT . ' ' f 

<213> Homo sapiens 

<400> 250 

Ala Ala Arg Gly Ser Gly Val Arg Asp Pro Leu Glu Glu Ala Val Cys 
1 5 • 10 15 . 

Pro Phe Ser Asp Leu Gin Leu His Ala Gly Arg Thr Thr Ala Leu Phe 

20 25 ' ,30 

• Lys Ala Val Arg Gin Gly His Leu Ser Leu Gin Arq Leu Leu Leu Ser 
' r 35 ■ ( - 40 45 

Phs Val Cys Leu Cys Pro Ala Pro Arg Gly Gly Ala Tyr Arg Gly Arg 
50 ' 55 60 

. Gin Ala Ser Leu Ser Cys.Gly Gly. r.em His Pro Val Arg Ala Ser Arg 
65 < ■ 70 75 60 

Leu Leu k Cys Leu Pro Lys Cln/Ala Trp Ala Met Ala Gly Ala Pro Pro 
8 5 .. 90 9? 

Pro Val Ser Leu Pro Pro Cys Ser Leu He ? Ser Asp Cys Cya Ala Ser 
100 105 HO 

Asn Gin Arg Asp Ser Val Gly 

115 . . : . ' - ' 



<210> 251 
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<211> 356 
<212> PRT 

<213> Homo sapiens . 

<220> . >■ 

<221> SITE 

<222> (37) ' 

<223> Xaa equals any of -the naturally occurring L-amino acids 

<220> 
• <221> SITE 
<222> (280) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<40O> 251 

Leu Ser Lys Ala .Phe Leu Asp Ser Pro Asn Arg Leu Leu Ala Val Glu 
1 - 5 10 15 

Met Asn Thr Asp His Leu Arg Leu Thr Val Pro Asn Gly He Gly Ala 

• 20 . 25 . 30 

Leu Lys Leu Arg Xaa Met Glu Kis Tyr Phe Ser Gin Gly Leu Ser Val 
35 40 45 

Gin Leu Phe Asn Asp Gly Ser Lys Gly Lys Leu Asn His Leu Cys Gly - 
50 ; 55 60 

Ala Asp Phe Val Lys Ser His Gin Lys Pro Pro Gin Gly Met Glu He 
€5 70 75 80 

Lys Ser Asn Glu Arg Cys Cys Ser Pho Asp Gly Asp Ala Asp Arg lie 
85 90 95 

Val Tyr Tyr Tyr His Asp Ala Asp Gly His Phe His Leu He Asp Gly 
100 .105 HO 

Asp Lys lie Ala Thr Lau He Ser Ser Phe Leu Lys Glu Leu Leu Val 
115 • ' 120 125 

Glu He Gly Glu Ser Leu Asn He Gly Val Val Gin Thr Ala Tyr Ala 
130 135 140 

Asn Gly Ser Ser Thr Arg Tyr Leu Glu Glu Val Met Lys Val Pro Val 
i45 150 "155 160 

Tyr Cys Thr Lys Thr Gly Val Lys His Leu Kis His Lys Ala Gin Glu 
165 170 175 

Phe Asp He Gly Val Tyr Phe Glu Ala Asn Gly His Gly Thr Ala Leu, 
180 ' 185 190 

Phe Ser Thr Ala Val Glu Met Lys He Lys Gin Ser Ala Glu Glti Leu 
195 ■ ! , 200 ( 205 

Glu Asp Lys Lys. Arg Lys" Ala Ala Lys Met Leu Glu Asn He He Asp ' 
210 215 220 

Leu Phe Asn Gin Ala Ala Gly Asp Ala He Ser Asp Met Leu Val lie 

225 ■ • 230 ' . . 235 240 
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Glu Ala lie Leu Ala Leu' Lys Gly Leu Thr Val Gin Gin Trp Asp Ala 
■ 245 ', ' ' -250 255 ' 

Leu Tyr Thr Asp Leu Pro Asn Axg Gin Leu , Lys Val Gin Val Ala Asp 
.260 265 . 270 

Arg Arg Val lie Ser Thr Thr Xaa Ala : Glu Arg Gin Ala Val Thr Pro 
275 ^ . 280 , 285 

Pro Gly Leu Gin Glu Ala lie Asn Asp Leu Val Lys Lys Tyr- Lys. Leu 
290 295 i: : 300 - * 

Ser Arg Ala Phe Val Arg Pro Ser Gly Thr Glu Asp Val Val Arg Val 
305 310 - ■ . 315' 320 

Tyr Ala Glu Ala Asp Ser Gin Glu Ser Ala Asp His Leu Ala His Glu 
325 , ' 330 335 

Val Ser Leu Ala Val Phe Gin Leu Ala Gly Gly He Gly Git Arg Pro 
340 345 3S0 

Gin Pro Gly Phe , 
3 55 



<210> 252 * 

<2ii>. 26 ' ' , 

<212> PRT \ • - ' • ' 

<213> Homo sapiens , ' 

<400> 252 

Leu Ser Lys Ala Phe Leu Asp Ser Pro Asn Arc' Leu Leu Ala Val Glu 
1 - 5 10 • • " • 15 

Met Asn Thr Asp His Leu Arg Leu Thr Val 
20 25 



<210> 253 ' '■ , ' 

<211> .28 ' ■ - , 

<212> PRT 

<213> Homo sapiens ' - ■ J 

<220> ' ' • ( • 

<221> SITE 
<222> (11) 

<223> Xaa equals any of the naturally occurring L-acuno acids 
<400> '253 , / 

Pro Asn Gly lie Gly Ala Leu Lys Leu Arg Xaa Met Glu His Tyr Phe- 
1 5 '• • 10 . '. 15 

Ser Gin Gly Leu Ser Val Gin Leu Phe Ash Asp Gly . ' - 

" 20 25 



<2i0> 254 
<211> 28 
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<212> PRT 

<213> Homo sapiens 1 ■; 

<400> 2 54 i ' . ■ . 

Ser Lys Gly Lys Leu Asn His Leu' Cys Gly Ala. Asp Phe Val Lys Ser 
1 - 5 10 : 15 

His Gin Lys Pro Pro Gin Gly Met Glu lie Lys Ser' 
20 • 25 



<210> 255 

<211> 28 1 

<212> PRT .- ; • 

t <213> Homo sapiens 

<400> 255 •' 
Asn Glu Arg Cys Cys Ser Phe Asp Gly Asp, Ala Asp, Arg He Val Tyr 
1 5 10 15 

Tyr. Tyr His Asp Ala Asp' Gly His Phe.. His Leu He 
20 . : 25 



<210> 256 ' - ■ i 

<211> 28 
<212> PRT 

<213> Homo sapiens 1 
<400> 256 

Asp Gly Asp Lys tie Ala Thr Leu lie Ser Ser Phe Leu Lys Glu Leu 
} 5 10 15 

Leu Val Glu He Gly Glu Ser Leu Asn lie Gly Val * " ' 
,20 ' 25 



<210>'257 * - 

<211> 28 ' . 
<212> ?RT 

<213> Homo sapiens 
<400> 257 , 

Val Gin Thr Ala Tyr Ala Asn Gly Ser Ser Thr Arg Tyr Leu Glu Glu 
1 5 10 • • 15 

Val Met Lys Val Pro Val. Tyr Cys Thr Lys Thr Gly 

20 25 ' 



<210> 256 
<211> 2 8 
<212>PRT 

<213> Komo sapiens 



<400> 258 , 

Val Lys His Leu His His Lys Ala Gin Glu, Phe Asp. lie Gly Val Tyr. 
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Phe Glu Ala Asn Gly His Gly Thr Aia Leu Phe Ser 
20 25 



■ <210> 259 
<211> 28 
<212> PRT 

<213> Homo sapiens 

<400> ,259. . 

Thr Ala Val Gin Met Lys Ile-Lys Gin Ser Ala Glu Gin Leu Glu Asp 
1 , ,'-5 10 ' 15 

Lys Lys Arg Lys Ala Ala , Lys Met Leu. Glu Asn lie 

■ 20; • "\ • 25 ^ ■■' 



<210> 260 . 
<211> 28 
<212> PRT 

<213> Homo sapiens i * 

<400> 260 

He Asp Leu Phe Asn Gin Ala Ala Gly, Asp Ala He Ser Asp Met Leu 
1 . . ' . 5 • .10 15 

Val lie Glu Ala lie Leu Ala Leu Lys Gly Leu Thr 
.20 25 



<210>-261 ■■ 

<211> 28 ' . 

<212> PRT . 

«;213> Homo sapiens , 

<400>261 « ' ■ ■ 

Val Gin C In Trp Asp Ala Leu Tyr Thr Asp Leu Pro Asn Arg Gin Leu 
1 5 10 . IS 

Lys : Val Gin Val Ala Asp Arg Arg Val He Ser Thr 
20 25 

<210> 262 
<211> 28 
<212> PRT 

<213> Homo sapiens 

<220> • 
<221> SITE 

<222> (2) ■ , ■ 

<223> Kaa equals any of the naturally occurring L-amino acids 
<4U0>- 262 . , . : 

Thr Xaa Ala Glu Arg Gin Ala Val Thr Pro Pro Gly Leu Gin Glu Ala 
1 - • • 5 10 15 



*le Asn Asp Leu Val Lys Lys Tyr Lys Leu Ser Arg 
20 25 
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' <210> 263 --<•... 
<2il> 24* 
<212>* PRT 

<213> Homo sapiens 
<400> 2 63 

Ala Phe Val Arg Pro Ser Gly Thr Glu Asp Val Val Arg. Val Tyr Ala 
' 1 5 10 15 

Glu Ala Asp Ser Gin Glu Ser! Ala ' 

20 ' ■■ *■ 



<210> 264 
<211> 26 

<212> PRT ' 
<213> Homo sapiens ' . " 

<400> 264 

1 Asp His Leu Ala His Glu Val Ser Leu Ala Val Phe Gin Leu Ala Gly 
1 5 . 10 is 

Gly lie Gly Glu Arg Pro Gin Pro Gly Phe 
20 25 



■ <210> 265 

. <211> 443 

<212> PRT - 

<213> Homo sapiens 

<400> 265 ■ ■ 1 . 

Gly Thr Arg .Ala Ala Pro Gly Leu Gly Ala Trp* Gly Arg Arg Ser Pro 
1 5 10 f is 

Pro Ser Phe Ser Pro Pro J Arg' Pro Arg Arg Pre Gly Val Met Ala Gly 
20 25 30 

Leu Asn Cys Gly Val Ser He Ala Leu Leu Gly Val Leu Leu Leu Gly * 
35 , 40 45 

Ala Ala Arg Leu Fro Arg Cly Ala Glu Ala Phe Glu He Ala Leu Pro 
50 55 ' 60 

Arg Glu Ser Asn lie Thr .Vai, Leu He Ly3 Leu Gly Thr Pro Thr Leu 
65 70 , 75 80 

Leu Ala Lys Pro Cys Tyr lie Val He Ser Lys Arg His rle Thr Met 
' . 85 90 95 

Leu Ser lie Lys Ser Gly Glu Arg He Val Phe Thr Phe 'Ser Cys Gin 
100 105 ' .110 < 

Ser Pro Gin Asn His Phe Val lie Glu He Gin Ly3 Asn' Tie Asp Cys • 
US .120 125 

Met Ser Gly Pro Cys Pro Phe Gly Glu Val Gin Leu Gin Pro Ser Thr 
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130 135 140 

Ser Leu Leu Pro Thr Leu. Asn Arg Thr Phe lie Trp Aso Val Lys Ala 
145 150 f 155 • • * i 6 o 

His Lys Ser He Gly Leu Glu Leu Glh Phe Ser Tie Pro Arg Leu Arg . ' 

165 . 170 . 175 

Gin lie Gly- Pro- Gly Glu Ser Cys Pro Asp Gly Val Thr His Ser lie 
180 . ., 185 ' 190 

Ser Gly Arg lie Asp Ala Thr Val Val Arg He Gly Thr Phe Cys Ser 
"' 195 200 *. ■ 205 

Asn Gly Thr Val Ser Arg lie Lys Met Glh Glu Gly Val Lys Met Ala 
210 215 ' 220 

Leu His Leu Pro Trp Phe His Pro Arg Asn Val Ser Gly Phe Ser ~le 
225 , 2.30 235 . 240 

Ala Asn Arg Ser Ser He Lys Arg Leu Cys lie lie Glu Ser Val Phe 
245 ■ ■ 250 . ., 255 

Glu Gly Glu' Gly Ser Ala Thr Leu Met Ser Ala Ash Tyr Pro Glu Gly 
260 265 270 

Phe Pro Glu Asp Glu Leu Met Thr Trp Gin Phe Val, Val Pro Ala, His 
275 • , 280 285 

Leu Arg Ala Ser Val Ser Phe Leu Asn. Phe Asn Leu Ser Asn Cys Glu 
290- ' 295 .300 . 

Arg Lys Glu Glu Arg Val Glu Tyr Tyr lie Pro Gly Ser Thr Thr Asn 
305, 3io 315 320 

Pro Glu Val Phe Lys Leu Glu Asp Lys Gin Pro Gly Asn Met Ala Gly 

325 330 .'; 335 

Asn Phe Asn Leu ser Leu Gin Gly Cys Asp Gin Asp Ala Gin Ser Pro 
.340 345 350 

Gly Tie Leu Arg Leu Gin Phe Gin Val Leu Val Gin His Pro Gin Asn 

355 360 365 ' ' 

Glu Ser Asn Lys He Tyr Val Val Asp Leu Ser Asn Glu Arg Ala Met 
370 -375 380 

.Ser Leu Thr lie Glu Pro Arg Pro Val Lys Gin Ser Arg Lys Phe Val 
385 390 , 395 400 

Pro Gly Cys Phe Val Cys Leu Glu Ser Arg Thr Cyc Scr Ser Asn Leu 
405 41C* 415 

Thr Leu Thr Ser Gly Ser Lys His Lys lie Ser Phe Leu Cys Asp Asp 
420 425 430 

Leu Thr Arg Leu Trp Met Asn Val Glu Lys Pro ' ' 

435 440 • • 
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<210> 266 . . * 

<211> 159 
<212> PRT 

<213> Homo sapiens 

, <400> 266 ' 
Phe Glu lie Ala Leu Pro , Arg Glu Ser Asn lie Thr Val Leu lie Lys 
1 5 10 . 15 

Leu Gly Thx Pro Thr Leu Leu Ala Lys Pro Cys Tyr He Val lie Ser 
20 25 30 . 

Lys Arg His He Thr Met Leu Ser lie. Lys Ser Gly Glu Arg He Val 
35 - 40. . 45 

Phe Thr Phe Ser Cys Gin Ser Pro Glu Asn His Phe Val He Glu He 
50 55 60 

Gin Lys Asn He Asp Cys Met Ser Gly Pro Cys Pro Phe Gly Glu Val 
65 70 75 - 80 

Gin Leu Gin Pro Ser Thr 1 Ser- Leu Leu Pro Thr Leu Asn Arg Thr Phe 
85 90 95 

He Trp Asp Val Lys Ala His Lys Ser lie Gly Leu Glu Leu Gin Phe 
100 ' 105 110 

Ser He Pro Arg Leu Arg Gin, lie Gly Pro Gly Glu Ser Cys Pro Asp 
115 120 .' 125 

Gly Val Thr His Ser 'lie Ser Gly Arg He Asp Ala Thr Val Val Arg 
130 135 140 

He Gly Thr Phe Cys Ser Asn Gly Thr Val Ser Arg He Lys Met 
145 150 155 



<21Q> 267 '*.'.., 
<211> 9 

<212> PRT ( ... 

<213> Homo sapiens 

<;400> 267 

Phe Val Arg Asp Pro Phe Val Arg Leu 

1 S : 

<210> 268 
<211> 13 
<212> PRT ' 
<213> Homo sapiens 

<400> 268 

Phe Leu Phe Val Arg Asp Pro Phe Val Arg Leu He Ser 
1 5 10 



<210> 269 
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<211> 15 , 
<212> PRT _ , 

<213>. p Homo sapiens - 
<4O0> 269 

?he Leu Phe Val Arg Asp Pro Phe Val Arg Leu lie Ser Ala Phe 
1 5 10. 15 



. <210> 270 

<211> 380 ■ - , ' 

<212> PRT . ■ / . . " 

<213> Homo sapiens. \ t ' . 1 

<400> 270 ■ ' ' . - ■ 

Tyr Leu Kis Thr Ser Phe Ser Arg Pro His Thr Gly Pro Pro Leu Pro 
1 5 ' 10 15 

; Thr Pro Gly Pro Asp Arg Asp Arg Glu Leu Thr Ala Asp Ser Asp Val 
20 25 30 

Asp Glu Phe Leu Asp Ly3 Phe Leu Ser Ala Gly Val Lys Gin Ser Asp 
35 • 40 '• 4b 

Leu Pro Arg Lys Glu Thr Glu Glh Pro Pro Ala Pro Gly Ser Met Glu 
" 50 55 60 

Glu Asn Val Arg Gly Tyr Asp Trp Ser Pro Arg Asp Ala Arg Arg Ser ■ 
65 • 70:' 75 80 

Pro Asp Gin Gly Arg Gin Gin Ala Glu Arg Arg Ser Val Leu Arg Gly 
85 < 90 95 

Phe Cys Ala Asn Ser Ser Leu' Ala Phe Pro Thr Lys Glu Arg Ala Phe' 
• 100 . -105 110 

Asp Asp He Pro Asn Ser Glu Leu Ser His Leu lie Val Asp Asp Arg 
• 115 , 120 . .. 125 

His Gly Ala lie Tyr Cys Tyr Val Pro Lys Val Ala Cys, Thr Asa Trp' 
"130 v ,135 140 • 

Lys Arg Val Met lie Val Leu Ser Gly Ser Leu Leu His Arg Gly. Ala 
145 ^ ISO 155 160 

Pro Tyr Arg Asp Pro Leu Arg lie Pro Arg Giu His Val.Hio Asn Ala 
165 170 175 

Ser Ala His Leu Thr Phe Asn Lys Phe Trp Arg Arg Tyr Gly Lys Leu 

• . . . 180 t ■ 185 ' 190 

.Ser Arg His Leu Ket Lys Val Lys Leu Lys Lys Tyr Thr Lys Phs Leu 

195 _ .. ^' - 200 . 205 ■ 

' * Phe Val Arg Asp Pro Phe Val Arg Leu lie Ser Ala Phe Arg Ser Lys 
210 . 215 220 

Phe Glu Leu Glu Asn Glu Glu Phe Tyr Arg Lys Phe Ala Val Pro Met 
225 . 230 235 240 
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Leu Arg Leu Tyr Ala Asn His Thr Ser Leu Pro Ala Ser Ala Arg Glu 

' 245 : ' / 250 ' ^ 255 

Ala Phe Arg Ala Gly Leu Lys Val Ser Phe Ala Asn Phe lie Gin- Tyr 
.' ' 260 265 270 • 

Leu Leu Asp Pro His Thr Glu Lys Leu Ala Pro Phe Asn Glu His.: Trp 
275 '' 280 ■ • 235 • 

Arg Gin Val 'Tyr Arg Leu> Cys His Pro Cys Gin lie Asp Tyr Asp Phe 
290 .295 -300 

Val Gly Lys Leu Glu Thr, Leu Asp Glu. Asp Ala Ala Gin Leu Leu Gin 
305 - 310 315 320 

Leu Leu Gin Val Asp Arg Gin. Leu Arg Phe Pro Pro Ser Tyr, Arg Asn ' 
325 ■ 330 .' 335 

"Arg Thr Ala Ser Ser Trp Glu ,Glu Asp Trp Phe' Ala Lys Tie Pro ^eu 
340 ■ » ^ 345 350 

Ala Trp Arg Gin Glri Leu Tyr' Lys Leu Tyr Glu Ala Asp Phe Val Leu 
3 55 360 , ■ 365 

Phe Gly Tyr Pro " Lys Pro Glu Asn Leu Leu , Arg Asp 
370 375 380 

<210> 271 - 

<211> 274 

<212> PRT 

<213> Homo sapiens 

<400> 271 

Lys Leu ! Val Arg Leu Gin Val- ; Pro .Val Arg' Asn Ser Arg Val Asp Pro 
1 5 10 ' 15 

Arg val Arg Ser Lys lie Gly Ser Arg Arg Trp Met Leu Gin Leu lie ' 
20 .25 30 

Met Gin Leu Gly Ser Val Leu Leu' Thr Arg Cys Pro Phe Trp Gly Cys 
35 40 45 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg -Ala Glu Ala Arg Arg Lys 
50 ( 55 ' 60 ' 

Pro Asp t lie pro val.. 'pro Tyr Leu Tyr ?he Asp Met Gly Ala Ala Val 
'65 70 ( ■ ,75 80 ' 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 

85 ' :' 90 " 95 

Leu Gly Ala Ala Leu Gin Leu Ala lie Ser, Thr* Tyr Ala Ala Tyr Ik 
' 100 105 110 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 
115 . 120 125 
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Arg Thr Val Ala He He Gly Giy Phe Leu Val Leu Ala Ser Gly Ala 
: 130 ■ 135 . 140 

Gly Glu Leu Tyr Arg' Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
145 150 - 155 160 

Gly Gin Val Phe Leu Gly He Tyr Leu lie Cys Val Ala Tyr Ser Leu- 
1G5 170 , 175 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
180 185 190 

'Gly Glu Leu Met He Gin Leu Phe Phe Val Leu Tyr Gly lie Leu Ala 
195 ■ ' 200 205 

Leu Ala Phe Leu Ser Gly Tyr Tyr Val Thr Leu Ala Ala Gin He Leu 
210 ' 215 220 

Ala Val- Leu Leu Pro Pro Val Met Leu Leu lie Asp Gly' Asn -Val Ala 
225 , 230 235 240 

Tyr Trp His Asn Thr Arg Arg Val Glu Phe Trp Asn Gin Met Lys 1 Leu 
245 , 250 255 

Leu Gly Glu Ser Val Gly He Phe Gly Thr Ala Val lie Leu Ala Thr 
260. . 265 270 

Asp Gly 



• <210> 272 . . . 

• <211> 203 •„ ' . 
<212> PRT 

<213=» Homo sapiens* . 

<400> 272 . . 

Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Trp Gly cys 
1 5 10 15 • 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 
20 25 30 

Pro Asp lie Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 

35 . , 40 ' ' ~ ,45 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
50 55 ■ 60 

Leu Gly. Ala Ala Leu Gin Leu Ala He Ser Thr Tyr Ala Ala Tyr lie 
65 70 '75 80 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 
. 65 - • 90 95 

Arg Thr Val Ala Ho He Gly Cly Phe Leu Val Leu Ala Ser Gly Ala 
100 , 105 HO 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
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115, 120 125 

Gly Gin Val Phe Leu Gly lie Tyr Leu lie Cys Val Ala Tyr Se- Leu 
130 •. ' 135 , 140 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
. 145 ' 150 155 160 

,Gly Glu Leu Met He Gin Leu Phe Phe Val Leu Tyr Gly He Leu Ala 
155 170 175 

Pro Gly Leu Ser Val, Arg Leu Leu Arg Asp Pro Argr Cys Pro Asp Pro 
180 185 190 

Gly Cys Thr Ala. Ala Pro Cys His Ala Ala His 
195 200 



<210> 273 
<211> 407 
<212> PRT ' 
- <213> Homo sapiens 

. <400> 273 

Ser, Asn Glu '"lie Leu Leu Ser Phe Pro Gin Asn Tyr Tyr He Gin Trp 
1 5 .10 15 

Leu Asn Gly Ser Leu .He His Gly Leu Trp Asn Leu Ala Ser Leu Phe 
■ 20 .25 30 

Ser Asn Leu Cys Leu Phe Val Leu Met Pro Phe Ala Phe Phe Phe Lou 
35 -40 ' * 45 

Glu Ser Glu ? Gly ' Phe Ala Gly Leu Lys Lys Gly He- Arg Ala Arg lie 
50 ' 55 60 

Leu Glu Thr. Leu Val Mat Leu Leu Leu Leu Ala Leu Leu lie Leu Gly 

* 65 ' 70 75 80 

lie Val Trp Val Ala Ser Ala Leu He Asp Asr. Asp Ala Ala Ser Met 
85 90 95 

Glu Ser Leu Tyr Asp Leu Trp Glu Phe Tyr Leu Pro Tyr Leu Tyr Ser 
100 105 HO 

Cys lie Ser Leu Met Gly Cys Leu Leu Leu Leu Leu Cys Thr Pro Val 
115 120 125, 

Gly Leu Ser Arg Met Phe Thr Val Met Gly His Leu Leu Val Lys Pro 
. 130 . , . 135 140 

Thr lie Leu Clu Asp Leu Asp Glu Gin lie Tyr He He Thr Leu Glu 
145 - , 150 155 160 

Glu Glu Ala Leu Gin Arg Arg Leu Asn Gly leu Ser Ser Ser Val Glu 
1S5 170 175 

Tyr Asn He Met Glu Leu Glu Gin Glu Leu Glu Asn Val Lys Thr Leu 

-■■ . . *" , 180 1 185 190 
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Lys Thr Lys Leu Glu Arg Arg Lys Lys Ala Ser Ala Trp Glu Arg Asn 
.195 ' 200 205 • 

Leu Val Tyr Pro Ala. Val Met Val Leu Leu Leu lie Glu Thr Ser lie ■ 

210 • . 215 * , 220 

Scr Val Leu Leu val Ala' Cys Asn lie Leu Cys Leu Leu Val Asp Glu v 
225 : 230 235 240 ■ 

Thr Ala Met Pro Lys Gly Thr Arg Gly Pro Gly lie Gly Asn Ala Ser 
245 _ ' 250 ■ 255 

Leu Ser Thr Phe Gly Phe Val Gly Ala Ala Leu Glu lie He Leu lie 
260 " 265 270 

Phe Tyr Leu Met Val Ser Sar Val Val Gly Phe Tyr Ser Leu Arg Phe 
.275 280 285 • . 

Phe Gly Asn Phe Thr Pro Lys Lys Asp Asp Thr Thr Met Thr I.y.s' He V 
290 295 , 300 ' ' 

He Gly Asn Cys Val Ser lie Leu Val Leu Ser* Ser Ala Leu Pro Val 
305 310 , 315 320 

Met Ser Arg Thr Leu Glvr lie Thr Arg Phe Asp Leu Leu Gly Asp Phe 
325 330 ' ^335 

Gly Axg Phe Asn Trp Leu Gly Asn Phe' Tyr He Val Leu Ser Tyr Asn 
340 345 350 

Leu Leu Phe Ala He Val Thr Thr Leu Cys Leu, Val Arg Lys Phe Thr r 
355 360 365 

Ser Ala Val Arg Glu Glu Leu Phe Lys Ala Leu Gly Leu His Lys Leu 
370', 375 380 

His Leu Pro Asri Thr Ser Arg Asp Ser Glu Thr Ala 1 Lys Pro Ser Val 
385 390 395 400 

Asn Gly His Glh Lys Ala Leu 

405 ' ■ . . ' ' • ■ • 



<210> 274 , 
<211> 165 
<212> PRT 

< 2 1 3 > Homo sapiens i' t. 

<400> 274,--.-, 

Axg Ser Tyr Met Cln Ser Val Trp Thr t Glu Clu Ser Gin Cys Thr Leu 

1 ;" 5 1 10 t 15 

Leu Asn Ala Ser lie Thr Glu Thr Phe Asn Cys Ser Phe Ser Cys Gly 
20 25 30 



Pro Asp Cys Trp Lys Leu Ser Gin Tyr Pro Cys Leu Gin Val Tyr Val 
35 . 40 45 
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Asn Leu. Thr Ser Ser Gly Glu Lys Leu Leu Leu*Tyr His Thr Glu Glu . 
50 55 60 * 

Thr lie Lys He. Asn Gin Lys Cys Ser Tyr He Pro Lys Cys Gly Lys ' 
65 70 75 80 ( 

Asn Phe Glu Glu Ser Met Ser Leu' Val Asn Val Val Met Glu Asn Phe 
85 90 - 95 

Arg Lys Tyr Gin His Phe Ser Cys Tyr - Ser Asp Pro Glu Gly Asn 31 n , 
100 ' 105 110 

Lys Ser Val lie Leu Thr Lys Leu Tyr Ser Ser Asn Val Leu Phe His 
115. ( ' 120 . 125 , 

Ser Leu Phe Trp Pro Thr Cys Met Met Ala Gly, Gly Val Ala He Val 
130 135 140 

Ala Met Val Lys Leu Thr Gin Tyr Leu Ser Leu Leu Cys t Glu Arg lie 
145 , ■ ,150 • 155 ' 160 

Gin; Arg rle Asn Arg ^ 
165 



<210> 275 I . 
<211> 155 > 
<212> PRT 
<213> Homo sapiens 

<4p0> 275 

Ala Phe Ala His Leu Gin Leu Gly Pro Met Trp Lys Leu Trp Arg Ala 
.1 • 5 10 15 

Glu Glu Gly Ala Ala Ala Leu Gly Gly Ala Leu Phe Leu Leu Leu Phe 

20 • / 25 ■ 30 

Ala Leu Gly Val Arg Gin Leu Leu Lys Glh'Arg Arg Pro Met Gly Phe 
35' ; 40 . 45 

Pro Pro Gly Pro Pro Gly Leu Pro Phe He Gly Asn He Tyr Ser Leu 
'50 55 60 

Ala Ala Ser Ser Glu Leu Pro His Val Tyr Met Arg Lys Gin Ser Glh 
55 70 . 75 80 

Val Tyr Gly Glu Val Gin Pro Arg Arg Ala Pro Gly Arg Glu Gly Arg 
8b 90 95 

(' 

Gin Ala Gly Pro Gly Trp Pro Gly Pro Ser Trp Leu Asp Leu Trp Pro 
100 105 110 

Pro Leu Gly Arg Leu Val Gly Thr Ser ' Pro Cys Ala Gly Cys Pro Leu 
115 120 125 

Arg Aap Thr Arg Phe Pro Gly Leu Glu Gly Arg Ser Pro Arg Arg Arg 
130 135 140 

Ala Pro Leu Gin Gly Glu Pro Arg Pro Cys Arg 
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145 150 155 



<210> 276 

<211> 42, ■ . 

<212> PRT - 
<213> Homo sapiens 

<400> 276 \ 

Met Axg Val Arg lie Gly Leu Thr Leu Leu Leu Cys* Ala Val Leu Leu 

1 5 * 1G ■ 15 

Ser Leu Ala iser Ala ser ser Asp Glu Glu Gly ser Gin Asp Glu ser 
20 25 • 30 



Leu Gly Phe Gin ■ Asp Tyr Phe Asp lie Arg 
35 ' 40 



<21G> 277 
.'<211> 155 
<212> PRT . 

<213> Homo sapiens ■ , 

<400> 277 ' ■ . 

Met Ala Arg Gly Ser Leu Arg Arg Leu Leu Arg Leu Leu Val Leu Gly 
1 5 10 15 

Leu Trp Leu Ala Leu Leu. Arg Ser Val Ala Gly Glu Gin Ala' Pro Gly 
20 25 ,\ 30 

Thr Ala , Pro Cys Ser Arg Gly Ser Ser Trp Ser Ala Asp! Leu Asp Lys 
35 40 45 ' 

Cys Met Asp Cys Ser Thr. Ser Cys Pro Leu Pro Ala Ala Leu Ala His 

50 ' ' 55 f 60 ' ' '• 

Pro /Trp Gly Arg Ser Glu Pro Asp Leu Arg Ala Gly Ala; Ala Phe Trp 
65 70 75 < , 80, 

Leu Phe Gly Leu Glu Thr' Met Pro Gin Glu Arg Glu Val His His Pro 
85 90 95 

His Arg' Gly Asp Arg Arg Arg Gly Leu Pro Ser Cys Gly Ala Asp Pro 
100 105' 110 

Val Thr Met Cys Pro Leu Pro Ala Gly Ala Arg Pro Leu He lie His 
115, ,"• 120 .'. 125 

Ser Ser lie Leu Glu Pro Val Ser Ala Ser Gin Thr Arg Arg Glu Pro 
130 . 135 . 140 

Ser Ser Ser Asn His Lys Gly Gly Gly Gly Arc 
145 150 155 



<210> 278 
<211> 207 
<212> .PRT 
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<213> Homo sapiens 
<400> 278 

Gly Thr Ser Phe Leu Asp Pro, Thr Leu Ser Leu Phe Val Leu Glu Lys 
1 5 10 . . 15 

Phe Asn Leu Pro Ala Gly Tyr Val Gly Leu Val Phe Leu Gly Met Ala 
20' 25 30 • 

Leu Ser, Tyr Ala lie Ser Ser Pro Leu Phe Gly Leu Leu Ser Asp Lys 
. 35 40 . 45 

Arg Pro Pro Leu Arg Lys Trp- Leu Leu Val Phe Gly Asn Lou He Thr 
50 55 60 

Ala Gly Cys Tyr Met Leu Leu Gly Pro Val Pro lie Leu His lie Lys 
65 . 70 75 , . 80 

Ser Gin Leu Trp Leu Leu Val Leu He Leu Val Val Ser' Gly Leu Ser . 
• 85 90 95 ' 

Ala Cly Met Ser lie He Pro Thr Phe Pro Glu He Leu Ser Cys Ala 
100 ■ 105 110 

', His Glu Asn Gly .Phe Glu Glu Gly Leu Ser Thr Leu Gly Leu Val Ser 
115 120 • 125 

Gly Leu Phe Ser Ala Met Trp . Ser lie Gly Ala Phe Met Gly Pro Thr 
130 . " . * 135 140 ' 

Leu Gly Gly Phe Leu Tyr Glu Lys He Gly Phe Glu Trp Ala Ala Ala 
145 - 150 155 160 

He Glh Gly Leu .Trp Ala Leu He Ser Gly Leu Ala Met Gly Leu Phe 
165 170 175 

Tyr Leu Leu Glu Tyr Ser Arg Arg Lys Arg Ser Lys Ser Gin Asn He 
180 . ' 185 190 

Leu Ser Thr Glu Glu Glu Arg Thr Thr Leu Leu Pro* Asn Glu Thr. 
195 200 205 



<210> 279 

<211> 85 ' 
<212> PRT , ' . 

<213> Homo sapiens ' , 
<400> 279 * . 

Gly Thr. Arg Glu Ala Arg Leu Arg Asp Leu Thr' Arg Phe Tyr Asp Lys 
1 5 ' 10 15 

Val Leu Ser Leu His Glu Asp Ser Thr Thr Pro Val Ala Asn Pro Leu 
20 , 25 . 30 

Leu Ala Phe Thr Leu lie Lys Arg Leu Gin, Ser Asp Trp Arg Asn Veil 
•35 40 45 



Val His Ser Leu Glu Ala Ser Glu Asn He Arg Ala Leu Lys Asp Gly 
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SO 55 60 

Tyr Glu Lys Val Glu Gin Asp Leu Pro Ala Phe Glu Asp Leu Glu Gly 
65 70 75 80 

Ala Ala Arg Ala Leu ' 

85 • . . 



<210> 280 

<211> 7 

<212> PRT 

- <213> Homo sapiens 

<400> 280 
' Ala Leu Met Arg Leu Gin Asp 
' 1 5 ' 



<210> 281 
<211> 7 
<212> PRT \ ' 
<213> Homo sapiens 

<400> 281 

Val Glu Ala Gly Gly Ala Thr 
1 5 



<210> 282 
<211> 489 
<212> PRT 

<213> Homo sapiens ' ■ 

<400> 282 • ' . ' 

Gly thr 'Arg Glu Ala Arg Leu Arg Asp Leu Thr Arg Phe Tyr Asp Lys 

1 .. 5 . ' 10 , 15 

Val Leu Ser Leu -His Glu Asp Ser Thr Thr Pro Val Ala Asn Pro Leu 
20 25 < 30 

Lau Ala Phe Thr Leu He Lys Arg Leu Gin Ser Asp Trp Arg Asn Val 
35 40 ' 45 

Val His Ser Leu Glu Ala Ser Glu Asn He Arg Ala Leu Lys Asp Gly 
'50 55 60 

Tyr Glu Lys val Glu Gin Asp. Leu Pro Ala Phe tilu -Asp Leu tilu Gly 
65 ■ 70 ' 75 . 80' 

Ala Ala Arg Ala Leu Met Arg Leu Gin Asp Val Tyr Met Leu Asn Val 
85 ' 90 ' 95 

Lys Gly Leu Ala Arg Gly Val. Phe Gin Arq Val Thr Gly Ser Ala He 
100 105 110 



Thr Asp Leu Tyr Ser Pro Lys Arg Leu Phe Ser * Leu Thr Gly Asp Asp 
115 120 . 125 
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Cys Phe Gin Val Gly Lys Val Ala Tyr Asp Met Gly Asp Tyr Tyr His 
130 , 135 140 

Ala 'lie Pro Trp Leu Glu Glu Ala Val Ser Leu Phe Arg Gly Ser Tyr 
145 150 155 160 

Gly .Glu Trp Lys Thr Glu Asp Glu Ala Ser Leu Glu Asp Ala Leu Asp 
165 . ' 170 17b 

His Leu Ala Phe Ala Tyr Phe Arg Ala Gly'Asn Val Ser Cys Ala Leu 
190 185 190 

Ser Leu Ser Arg Glu Phe Leu Leu Tyr Ser Pro Asp Asn Lys Arg Met 
,195 200 205 

Ala Arg Ash Val Leu Lys' Tyr Glu Arg Leu Lou Ala Glu Ser Pro Asn 
'210 215 220 

His Val Val Ala Glu Ala Val lie Gin Arg Pro Asn He Pro His Leu 

225 ; 230 ' 235 \ 240 

Gin Thr Arg A3p Thr Tyr Glu .Gly Leu Cys Gin Thr Leu Gly Ser Gin 
245 ,250 255 

Pro Thr Leu Tyr Gin He Pro Ser Leu Tyr Cys Ser Tyr Glu Thr Asn 
260 ■ . * 265 270 

Ser Asn Ala Tyr Leu Leu Leu Gin Pro He Arg Lys Glu Val lie His 
275 280 285 

Leu Gin Pro Tyr He Ala Leu Tyr. His Asp Phe Val Sor- Asp Ser Glu 
290 , 295 300 

Ala Gin Lys He Arg Glu Leu Ala. Glu Pro Trp Leu Gin Arg Ser Val 
305 310 .315 .320 

Val Ala Ser -Gly. Glu Lys Gin Leu Gin Veil Glu Tyr Arg lie Ser Lys 

\ 325 ; ; ■ ■ 330 335 

Ser Ala Trp Leu Lys Asp Thr Val Asp Leu Lys Leu Val Thr Leu Asn 
340 345 3 50 

His Arg lie Ala Ala Leu Thr Gly Leu Asp Val Arg Pro Pro Tyr Ala 
355 , 360 365 

Glu Tyr Leu Gin Val Val Asn Tyr Gly He Gly Cly His Tyr Glu Pro 
370- • 375 380 

His Phe Asp ! His Ala Thr Ser Pro Ser Ser Pro Leu Tyr Arg Met Lys 
385 ,. 390 395 400 

Ser Gly Asn Arg Val Ala Thr Phe Ket He Tyr Leu ser Ser Val Glu 
. 405 ,410 415 ' 

Ala Gly Gly Ala Thr Ala Phe He Tyr Ala Asn Leu Ser Val Pro Val 
. • 420 425 430 

Val Arg Asn - Ala Ala Leu Phe Trp Trp Asn Leu His Arg Ser Gly Glu 
435 , 440 445 
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Gly Asp Ser Asp Thr Leu His Ala Gly Cys Pro Val Leu- Val G_.y Asp' 
450 455 460 

Lys Tfp Val Ala Asn Lys Trp'lie His Glu Tyr Gly Gin Glu Phe Arg 
465 470 , 475 480 

Arg Pro Cys Ser Ser Ser Pro Glu Asp .. 
485 



<210> 2S3 > ' ' 

<211> 136 ' 
<212> PRT . 

< 2 1 3 > , Homo sapiens . ' 

<400> 283 

lie Gin Pro Ser His Ala Ala Leu Leu His Cys Arg Ser Thr Phe Arg 

1 5 . 10 • •. 15 

Lys Thr Glu Cys Leu Asp Pro Trp Trp Val Arg Arg Gin Leu Leu Gly 

20 w ' • ' 25 30 - v 

Met Ala Gly lie Gly Gly Leu Gin Lys Met Lys Ala Pro His Thr Gly 
- .35 . 40 45 

Val Leu His Leu Gly Ser Val Trp Val The Leu Gly Vro Phe Leu Leu 

. 50 , ^ * • ,.' . 55 '* 60 

Gly Val Gly Tyr Thr Leu Thr Phe Asn Pro Leu Ser Gly Cys Met Ser 
65 70 / 75 80 

Thr Val Arg Trp Leu Asn Ser Asn lie Thr Ala Asn Arg Thr Lea Ser 
85 .90 95 

Arg Ser Val Cys His Val Thr 1 Pro Leu His Arg Ser Leu Ser Pro Mis 
100 105 110 

Asp Gly Glu' Tyr Leu Arg Gin Met Leu Leu Asn Ser Ser Ser Arg Ala 
115 120 125 

Gly Glu Ala Gly Ser Trp Gly Tyr * 
130 / 135 



<210> 284 , 

<211> 96 ' . 

<212> PRT , ' 

<213> Homo. sapiens .. .. % 

<400> 2 84 

Cy3 Ser Ser Pro Pro Gly Arg Leu Pro Trp Cys Trp Tht Ala Pro Arg 

1 5 " ; t ' 10 • . 15 

Thr Leu Gly Lys His Gly Ser Leu He Ser Thr Leu Arg Leu Thr Ala 
20 25 .30 



Pro Leu His Leu Ala Trp Lys Met Met Leu Ser Arg Lys Ala Leu Phe 
35 '40 45 
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Val Leu. Leu Asn Thr Pro Val Leu Phe His Ala Leu Glu Gly Arg Leu 
50 55 60 

Phe Ser Lys Leu Cys His His His Thr He Gin Arg Thr Leu Thr Val 

65 70 ; ! ■ • 75 80 

Pro^Lys Phe Arg Ser Ser " 

' 85 '* - . 



<210> 285' . 
■ <211> 75 • * ' 

<212> PRT 

<213> Homo sapiens 
<400> 235 

Arg Ser Pro Thr Ser Arg Val Gin Leu Leu Lys Arg Gin Ser Cys Pro 

1, 5 ... 10 15 

Cys Gin Arg Asn Asp, Leu Asn Glu Glu Pro Gin. His Phe Thr His Tyr 

r 20 ' ■ ( ■ 25- .30 . 

Ala He Tyr Asp phe He Vai Lys Gly Ser Cys Phe Cys Asn Gly His , 
35 . 40 45 

Ala Asp Gin Cys lie Pro Val Hi3 Gly Phe Arg Pro Val Lys Ala Pro 

50 ' 55/ ' 60 

Gly Thr Phe. His Met Val His Gly Lys Cys Met ' 
65 70 * .75 



<210> 286 .■■«'.' 
<21i> 296 

<212> PRT • 
<213> Homo sapiens 

<400> 286" 

His Asn Thr Ala Gly Ser His Cys Gin His Cys Ala Pro Leu ♦ Tyr Asn 
1 5 . 10 15 

. Asp Arg Pro Trp Glu Ala Ala Asp Gly Lys Thr Gly Ala Pro Asn Glu 
20 . 25 30 

Cys Arg Thr Cys Lys Cys Asn Gly His Ala Asp Thr Cys His Phe Asp 
,35 • / 40 45 

Val Asn . Val Trp Glu Ala Ser Gly Asn Arg Ser .Gly Gly Val Cys Asp 
50 55 60 

Asp Cys Gin His Asn Thr Glu Gly Gin Tyr Cyu Gin Arg cys Lys Pro 
65 70 75 80 

Gly Phe Tyr Arg Asp Leu Arg Arg Pro Phe Ser Ala Pro Asp Ala ' Cys 
85 90. . 95 

Lys Pro cys Ser Cys His Pro Val Gly Ser Ala . Val Leu Pro Ala Ash 
100* ,105 110 
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Ser Val Thr Phe Cys Asp Pro Ser Asn Gly Asp Cys Pro Cys Lys Pro 
.115 . 120 . 125 

Gly Val Ala Gly Arg Arg Cys Asp Arg Cys Met Val Gly Tyr Trp Gly 
130 135 140 

Phe Gly Asp Tyr Gly Cys Arg Pro cys Asp cys Ala Gly ser Cys Asp 
145, t ; ' 150 ',,155' * 160 

Pro Tie Thr Gly Asp Cys lie Ser Ser His Thr Asp. lie Asp Trp Tyr 
165 _ 170 1^5 

His Glu Val Pro Asp Phe Arg Pro Val His Asn Lys Ser Glu Pro Ala 
' 180 ' : 185 _ '190 

Trp Glu Trp Glu Asp Ala Gin Gly -Phe Ser Ala. Leu Leu His Ser Gly 
195 ,200 205 

Lys Cys Glu Cys ' Lys Glu Gin Thr Leu Gly Asn Ala. Lys Ala Phe Cys 
210 215 1 220 ' 

Gly Met Lys Tyr Ser' Tyr Val Leu Lys lie Lys lie Leu Ser Ala His 
225 230 _ 235 240 

Asp Lys Gly Thr His Val Glu val Asn Val Lys lie Lys Lys Val Leu 
- 245 250 255 

Lys Ser Thr Lys Leu Lys lie Phe Arg Gly Lys Ala Asn He He Ser 
260 • • 265 , 270 

Arg lis Met Asp Gly Gin Arg Met His Leu' Ser Asn Pro Gin Ser Trp 
■■ 275 t 280 285 

Phe Gly He Pro ' Cys Ser Arg Thr . 1 

290 295 



<210> 287 • . 

<21i> 37 
<212> PRT 

<213> Homo sapiens ' ' 

<400> 287 ' \ 

Cys Asp Asp Cys Gin His Asn Thr Glu Gly Gin, Tyr Cys Glh Arg Cys 
1 5 •. 10 15 

Lys Pro Gly Phe Tyr Arg Asp Leu Arg Arg Pro Phe Ser Ala Pro Asp 
20 25 " 30 • 

Ala Cys Lys Pro Cy3 
. 35 



<210> 28B 
<211> 36 
<212> PRT 

<213> Homo sapiens 
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<400> 288 

Cys Pro Cys Lys Pro Gly Vai Ala Gly Arc Arg Cys Asp Arg Cys Met 

• ,.1 •• ; 5 ,. ■ -1C 15 . 

Val. Gly Tyr. Trp Gly Phe Gly Asp Tyr t Gly Cys Arg Pro Cys Asp Cys 
20 ' . 25 30 

Ala Gly Ser Cys 

35 * ■ •• ' ' . 



<21Q> 289 \ ^ 

<211> 66 ' - t ' . ■ , ■ 

<212> PRT ' 
<213> Homo sapiens 

<400> 289 

Asn He Ser Ser Gin ,Tyr Cys-lle Leu Lys Ser Leu Glu Met Met "le 
1 5 10 15 

Ser Gly Leu Lys Leu Leu "Val Leu Phe Leu Lys Phe Ala Pro Glu Asn 

20 . ■' 25 ' 30 

Tyr Cys Leu Ser Thr Glu Thr Leu Gin Met Pro Asn Arg His Leu Arg 
,35 40 45 

Leu Scr Lys Ala Thr Cys Tyr Leu Met Lys Cys L*su Leu Pro Ser Tyr 
50 55' 60 

Phe Glu 

65 ' . ' ' 



<210> 290 j , 

<211> 88' 
<212> PRT 

<213> Horrio sapiens ■ ' * 

<400> 290 

Pro He Glu Gly Thr Pro Ala, Gly Thr Gly Pre Glu Phe Pro Gly Arg 
1 " 5' 10 ' 15 ' 

Pro Thr Arg Pro Gin Arg Met Arg Ser Leu He Ser Ser His Pro Cys 
20 25 30 

Gin His Leu 'Leu Leu Leu Leu Leu Leu Leu Phe Lou lie Leu Ala lie 

35 40 . 45 • ■ 

Leu Val Asp Val Lys Trp' Tyr Leu Val Leu Phe lie Cys He Ser Leu 
50 55 60 

Met Thr Ser Asp Val Glu His Leu Phe Met Cys Leu Leu Ala He Arg 
65 , ' - ■ 7 0 . * 75 80 

lie Ser Ser Trp Arg Asn Val Tyr - h 

35 



<210> •291* 



. PCT/US99;i7130 
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<211> 60 « 

<212> PP.T ■ 1 

<213> Homo' sapiens 

<400> 291 ' , " ■ ' *■ 

Asn ,Trp V*l Pro Thr Cys Leu Cys Pro Ser Ala Pro Cys Ser Phc His 
1 1 5 10 15 

Leu Leu Ser Arg Phe Lys Cys Leu Phe Ser Pro Gin Arg Leu Thr Asp 
20 25 30 

lie Phe Arg Arg Tyr Asp Thr Asp Gin Asp Gly Trp lie Gin Val Ser 
35 40 45 

tyr Glu Gin Tyx Leu Ser Met Val Phe Ser He Val 
50 55 , 60 

<210> 292 
<211> 33 

<212> PRT '.*.-.. 
<213> Homo sapiens 

<400> 292 

Gin Arg Leu Thr Asp He Phe Arg Arg Tyr Asp Thr . Asp Gin Asp Gly 
1 . 5 '10 15 

,Trp Ue Gin Val Ser Tyr Glu Gin Tyr Leu Ser Met Val Phe Ser lie 
20 25 30 

Val ' • *' ' ' . 



<210>' 293 
<211> 73 
<212> PRT 

<213> Homo sapiens 

<220> ' • 
<221> SITE ■ 
<222> (38), 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any - or the naturally occurring L -amino acids 
<220> 

<221> SITE- '* ■ * '. - • 

<222> (54) ( ' 

,<223>' Xaa equals any of the naturally occurring L-amino acids 

<220> ' 
<221>. SITE ' 

<222> (55) .' 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<220> 

<221>" SITE 1 .' . . 

<222> (63) 

<223> Xaa equals any of the naturally occurring L-aminb acids 
<400> 293 . . ' 

Met Phe Tyr Lys Leu Thr Leu lie, Leu Cys Glu Leu Ser Val Ala Gly 
1 5 . 10 15 

Val Thr Gin Ala Ala Ser' Gin Arg Pro Leu Gin Arg Leu Pro Arg, His 
20 25 30 

lie Cys Ser Gin Arg Xaa Pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa 
35 40. 45 

Leu Gin Thr Thr Trp Xaa Xaa Pro Asp Lys Pro lie Pro Arg Leu Ser 
50 • , 55 . 60 



Pro Pro= Leu xaa Ser Asp Pro Lys Arg 

65.* 70 



<210> 294 

<211> 95 .- • ' . _ 

<212> PRT 

<213>^Homo sapiens 
<A0Q> 294 . 

Thr Ser Ser Pro Val Phe Ser Phe Cys Ser Met Ala Val Arg Glu Pro 
1 • ' 5 10 15 

Asp His Leu Gin Arg Val Ser Leu Pro Arg Tyr Asn Val Ser Ala Ser 
20 . 25 30 

Leu Gin Trp Leu Pro Cys His Arg lie Val Leu Gin Pro Trp His Met 

35 ' 40 • 1 45 

Cys Ala Met Trp Glu Leu Gly Gin val Leu Phe His Pro, Val Ala Pro 
50' 55. 60 

Arg Glu Gly, -Ala' Ala Pro Ser Pro Val Ser Thr Leu . Thr Trp Pro Ser 
65 70 . 75 80. 

Ser Cys Ser, His Ser Glu Ser Thr Met Glu Leu Glu Leu Gin Phe 

: R5 '90 .95 



<210> 295 
, <2U> 16 
<212>>RT, 
<213> Homo sapiens 



<400> 295 

Met Ala Val Arg Glu Pro Asp His Leu Gin Arg Val Ser Leu Pro Arg 
1 5 10 15 
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<210> 296 
<211> 7 

<212> PRT . ■ 

<213> Homo sapiens 

<400> 296 

Leu Pro Cys His Arg lie Vai 

1 5 • 



<210> 297 '-' - 

<211> 15 • , * '■ 

<212> -PRT 

<213> Homo sapiens 

<400>' 297. ... 

Ser Leu Gin Trp Leu Pro Cys His Arg lie Vai Leu Gin Pro Trp 
1 5 10 15 



<210> 298 
<211> 454 
<212> PRT 
<213> Homo sapiens 



Lys Pro Lys Arg Glu His cys Ser Cys Pro lie Thr 

5 " • 10 ■ ,15 

Tyr Gin Ser Leu Gly Asp. lie Leu Asn Ala Ser Phe Phe Ser Lyc Arg ■ 

20 , . 25 30 

Lys Gly Met Gin Glu Vai Lys Leu Asn Ser Tyr Vai . Vai Ser Gly Thr 

35 f 40 / 45 

lie . Gly Leu Lys Glu Lys Zle Ser Leu Ser Gl'u Pro Vai Phe Leu Thr 

, .50 55 60 



<40"0> 298 
Cys Phe Lys Arg 

'1- ■ 



Phe Arg His Asn Gin Pro Giy Asp Lys Arg Thr Lys His lie Cys Vai 
65 ' . 70 . 75 80 

Tyr Trp Glu Gly Ser Glu Gly Gly Arg Trp Ser Thx Glu Gly Cys Ser 
85, 90 95 

His Vai His Ser Asn Gly Ser Tyr Thr Lys Cys Lyc Cys Phe Hia Leu 
. * 100 105 110 

Ser Ser Phe Ala Vai Lau Vai Ala t Leu Ala Pro Lys Glu Asp Pro Vai 
115 ; 120 125 . 

Leu Thr Vai lie , Thr Gin Vai' Gly Leu" Thr lie Ser' Leu Leu Cys Leu 
'.130 ' ... 135 140 ' 

Phe Leu Ala He Leu Thr Phe Leu Leu Cys Arg Pro lie Gin Asn" Thr 
145 \ 150 . 155 ' 160 

Ser Thr Ser Leu Kis Leu Glu Leu Ser Leu Cys . Leu Phe Leu Ala His 

' 165 . J 17C ' * „ ' 175 
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Leu Leu Phe Leu Thx Gly He Asn Arg Thr Glu Pro Glu Val Leu Cys 
180 185 ,190 

Ser lie. lie Ala Gly Leu Leii His Phe Leu Tyr Leu Ala Cys Phe Thr 
195 ■ 200 , 205 

Trp Ket L2U Leu Glu Gly Leu His Leu Phe Leu Thr Val Arg Asn L.eu 
210 215 , .220 

Lys Val Ala Asn Tyr Thr Ser Thr Gly' Arg Phe Lys Lys Arg Phe Met 

" 225 . . 230 ... ■; '235. 240 

Tyr Pro Val Gly Tyr Gly lie Pro Ala Val lie lie Ala Val Ser Ala 
245 250 . 255 

lie Val Gly Pro Gin. Assi Tyr Gly Thr Phe Thr His Cys Trp Leu Lys 
■ t ■ 260, 265 -270 

Leu Asp Lys Gly Phe Tie Trp Ser Phe Met Gly Pro Val Ala Val lie 
* 275 280 285 

lie Leu lie Asn Leu Val Phe Tyr Phe Gin Val , Leu Trp lie Leu Arg 
290 295 300 

Ser Lys Leu Ser Ser Leu Asn Lys Glu Val Ser Thr lie Gin Asp Thr 
305 ; . 310 315 320 

Arg Val Met Thr Phe Lys Ala lie Ser Gin Leu Phe lie Leu Gly Cys 

. ■ . . 325 ; _ 330 335 

Ser Trp Gly Leu Gly Phe Phe Met Val Glu Glu Val Gly Lys Thr lie 
340 345 3 50 

Gly Ser lie He Ala Tyr "-Ser Phe Thr TMe Tie Asn Thr Levi GVn Gly 
355 .■ . ■ 360 365 . 

i* - 
Val Leu Leu Phe Val Val His 'Cys' Leu Leu Asn Arg Gin Val Arg Met 
370 375 '380 

Glu Tyr Lys Lys Trp Phe Ser Gly Met Arg Lys Gly Val Glu Thr Glu 
385 390 ' 395 400 

Ser Thr Glu Met Ser Arg Ser Thr Thr Gin Thr Lys Thr Glu Glu Val 
.405 . 410 415 . 

Gly Lys Ser Ser Glu He Phe His Lys Gly Gly Thr Ala Ser Ser Ser 
420 425 430 

Ala Glu Ser Thr Lys Gin Pro Gin Pro Gin Val His Leu' Val Ser Ala 
435 440 445 ' 

Ala' Trp Leu Lys Met Asn . 
450 



<210> 299 
<211> 101 
<212> PRT 
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<213> Homo sapiens 
<400> 299 . ' 

Phe Phe Trp Lys Glu Asn Leu Arg Arg Asn' Gly Ser Arg Glu Asp Phe 

1 5 ' , •. .10 15 

Ala Arg Arg Ala .Thr Gin Leu lie Gin Ser Val Glu Leu Ser He Trp 
20 25 30 

Asn Ala* Ser Phe Ala' Ser Pro Gly Lys Gly Gin lie Ser Glu Phe Asp 
35 40 45 

lie val Tyr Glu Thr Lys Arg Cys Asn Glu Thr Arg Glu Asn Ala Phe 
50 • 55 60 

Leu Glu Ala Gly Asn Asn Thr Met Asp lie Asn Cys Ala Asp Ala Leu 

65 .10 75 1 " 80 

Lys Gly Ash Leu Arg Glu Ser Thr Ala Val Ala Leu "Ser Leu lie Asn 
fl= , 90 95 , 

Leu Leu Gly He Phe 
100 



<210> 300 

<21L> 27 

<212> PRT 

<213> Homo sapiens 



*400> 300 ■ _ 

Asp lie Asn Glu Cys Glu Thr Gly Leu Ala Lys Cys Lys Tyr Lys Ala 
1 5 10 15 

Tyr Cys Arg Asn. Lys val Gly Gly Tyr He Cys 

20 ' 25 



<210> 301 

<211> 12 

<212> PRT. , 

<213> Homo sapiens , 

<400> 301 

Cys Arg Asn Lys Val Gly Gly Tyr Tie Cys Ser Cys 
1 5 10 



<210> 302 

<21 1> 331- •' ; • 

<212> PRT , . 

<213> Homo sapiens 

<400> 302 '; 

Ala Leu Cys Pro His Pro His Leu. He Leu Asn Val Thr Val Sor Pro 
1 5 -,10 15 



Ala Pro Ser Cys Arg His Val Lys Lys Val val Ala ser Pro Ser Pro 
20 • 25 30 
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Ser Thr Thr Met lie Ala Met Asp Ala Pro His Ser Lys Ala Ala Leu 
35 40 45 

Asp Ser He Asn Glu Leu Pro Glu Asn lie Leu Leu Glu Leu Phe Thr 
50 • , 55 60 

His Val I'ro Ala Arg Gin Leu Leu Leu Asn Cys Axg Leu Val Cys Ser 
65 ' 70 .75 BO 

Leu Trp. Arg Asp Leu lie Asp Leu Met Thr Leu Trp Ly3' Arg Lys Cys 
85 90 95 

Leu Arg Glu Gly > Phe He Thr Lys Asp Trp Asp Gin Pro Val Ala Asp 
-100 105 * ' 110 

Trp Lys He Phe Tyr Phe Leu Arg Ser Leu ' His Arg Asn Leu Leu Arg 
115 120 125 

Asn Pro Cys- Ala Glu Glu Asp Met Phe Ala Trp Gin fie Asp Phe Asn 
130 135. ■ 140 

Gly Gly Asp Arg Trp Lys Val Glu Ser Leu Pro Gly Ala His Gly Thr 
145 150 155 160 

Asp Phe Pro Asp Pro Lys Val Lys Lys Tyr Phe Val Thr Ser Tyr Glu 
165 170 175 

Met Cys. Leu Lys Ser Gin Leu Val Asp Leu Val Ala Glu Gly Tyr. Trp 
180 185 190 

Glu Glu Leu Leu Asp Thr Phe Arg' Pro Asp He Val Val Lys Asp Trp 
195 200 205 

Phe Ala Ala Arg Ala Asp Cys Gly Cys Thr Tyr Gin Leu Lys Val Gin 
210 215 220 

Leu Ala Ser Ala Asp Tyr ?he Val Leu Ala Ser Phe Glu Pro Pro Pro 
225 230 235 240 

Val Thr lie Gin Gin Trp Asn Asn Ala Thr Trp Thr Glu Val Ser Tyr 
245 250 , 255 

Thr Phe Ser Asp Tyr Pro Arg Gly Val !Arg Tyr He Leu Phe Gin His 
260 265 .,270 

Gly Gly Arg Asp Thr Gin Tyr Trp Ala Gly Trp Tyr. Gly Pro Arg Val 
275 280 285 

Thr Asn Ser Ser He Val Val Ser Pro .Lys Met Thr Arg Asn Gin Ala 
290 295 . .. 300 

Ser Ser Glu Ala Gin Pro Gly Gin Lys His Gly Gin Glu Glu Ala Ala 

305 310 , % 315 . ' ,, - 320 

Gin Ser Pro Tyr Axy Ala Val Val Gin He Phe 
325 330 
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<210> 303 > 
<211> 328 

<212> PRT ' • * 

<213> Homo sapiens 

<400> 303 

Arg Gin Arg Ser Trp Asn Pro Gly Thr Asn Cys Tyr His Pro Asn Met 
1 5 \ 10 15 

?ro Asp Ala Phe Leu Thr Cys Glu Thr Vai He Phe Ala Trp Ala He . 
20 25 30 • 

Gly Gly Glu Gly Phe Ser Tyr Pro Pro His Val Gly Leu Ser Leu Gly 
35 40 45 . 

Thr Pro Leu Asp Pro His Tyr Vai Leu Leu Glu Val His . Tyr Asp Asn 
50 55 60 

. Pro Thr Tyr Glu Glu Gly Leu He Asp Asn Ser Gly Leu Arg Leu Phe 
65 '70 75 80 

Tyr Thr Met Asp' lie Arg Lys Tyr Asp Ala Gly Val He Glu Ala Gly' 
35 90 95 

Leu, Trp Val Ser Leu Phe His Thr' lie Pro Pro Gly Met Pro. Glu Phe 
100 105 • 110 

Gin Ser Glu Gly His'Cys Thr- Leu Glu Cys Leu Glu Glu Ala Leu Glu 
■ 115 '• ' • 120 125 

Ala Glu Lys Pro Ser Gly He His Val Phe Ala Val Leu Leu His Ala 

. 130 , , 135 ■ 140 

His Leu Ala Gly Arg Gly 'He Arg Leu Arg His Phe Arg Lys Gly Lys 
145 ,150 155 • 160 

Glu Met Lys Leu Leu Ala Tyr Asp Asp Asp Phe Asp Phe Asn Phe Gin 
165 4 170 175 

Glu Phe Gin Tyr Leu Lys Glu Glu Gin Thr He Leu Pro Gly Asp Asn 
180 185 190 

Leu He Thr Glu .Cys Arg Tyr Asn Thr Lys As? Arg Ala Glu Met Thr 
195 r 200 205 

Trp Gly Gly Leu Ser. Thr Arg Ser Glu Met Cys Leu Ser Tyr Leu Leu 
.210 215 220 

Tyr Tyr Pro Arg He Asn Leu Thr Arg Cys Ala Ser He Pro Asp lie 
225 230 235 . \ 240 

Met Glu Gin Leu Gin Phe He Gly Val Lys Glu lie Tyr Arg Pro Val 
245 '• " 250 " 255 

Thr Thr Trp Pro Phe He He Lys Ser Pro Lys Glri Tyr Lys Asn Leu 
260 ' 255 270 

Ser Phe Met Asp Ala Met Asn.' Lys Phe Lys Trp Thr Lys Lys Glu Gly. 
275 280 285 
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Leu Ser Phe Asn Lys Leu Vai Leu Ser Leu Pro Val Asn Val Arg Cys 

290 295 . 300 

Ser, Lys Thr Asp Asn Ala Glu Trp Ser lie Pro Arg Asn Asp Ser He 

305 310 315 320 

Thr Ser Arc. Tyr Arg Lys* Thr Leu 
325 .. ' • 



<210> 304 . 

<211> 272 ' .' * ' ' ' - ' 

<212> PRT 
'■ <213>,Hom6 sapiens * ' 

<400> 304 , 

Ket Cys Cys Trp Pro Leu Leu Leu Leu Trp Gly Leu Leu Pro Gly Thr 

■ 1 ■ ; , 5 10 ,15 

Ala Ala Gly Gly Ser Gly Arg Thr Tyr Pro His Arg Thr Leu Leu Asp 
• ' 20 25 30 

Ser Glu Gly Lys Tyr Trp Leu Gly Trp Ser Gin Arg Gly Ser Gin lie 
35 40 45 

Ala Phe Arg Leu Gin Val Arg Thr Ala Gly Tyr Val Gly Phe Gly Phe 
50 55 60 

Ser Pro Thr Gly 'Ala Met Ala Ser Ala Asp' He Val Val Gly Gly Val 
65 70 75 80 • 

Ala His Gly Arg Pro Tyr Leu Gin Asp Tyr Phe Thr Asn Ala Asn Arg 
• 85 90 . 95 

Glu Leu Lys Lys Asp Ala Gin. Gin Asp Tyr His Leu Glu Tyr Ala Met 
'• 100 105 ' HO" 

Glu Asn Ser Thr His Thr lie He Glu Phe Thr Arg Glu Leu His Thr 
115 120 • 125 

.Cys Asp He Asn Asp Lys Ser He Thr Asp ser Thr val Arg val lie 
130 135 t 140 

Trp Ala Tyr .His His Glu Asp Ala Gly Glu Ala Gly Pro Lys Tyr His . 
145 150 155 , 160 • 

Asp Ser Asn Arg 'Gly Thr Lys Ser Leu Arg Leu Leu Asn Pro Glu Lys 
•' 165 170 175 

Thr Ser* Val Leu Ser Thr Ala Leu Pro Tyr Phe Asp Leu* Val Asn Gin 
180 185 190 ' 

Asp Val Pro' Tie Pro Asn' Lys 'Asp Thr Thr Tyr Trp Cys Gin Met Phe 
t 195- 200 205 

.Lys lie Pro Val Phe Gin Glu Lys His His Val He Lys Val Glu Pro 
210 215 220 
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Val lie Gin Arg Gly His Glu Ser'Leu Val His. His lie Leu Leu Tyr 
225 .'• 230 235 240 

Gin cys Ser Asn Asn Phe Asn Asp Ser Val Pro Gly lie Arg Ala Arg 
, 245 .' . 250 "•' ' 255 

lie Ala lie Thr Pro Thr Cys. Pro Met His Ser Ser Pro Val, Lys Leu 
260 ; 26,5 270 



<210> 305 . ' 

<211> 207 • . . " . 

<212> PR.T ■ 

<213> Homo sapiens 

<400> 305 " 

Thr Gly Thr Phe Trp Ser Pro Arg Spt Gin Arg Arg Gly Cys Cys Gly 

1 ' 5 10 ' 15 

Arg Arg Ala 'Pro Arg Pro Glu Ala Met Glu Asn Gly Ala Vai Tyr Ser 
20 25 30 

Pro Thr Thr Glu Glu. Asp Pro Gly Pro Ala Arg Gly Pro Arg Ser Gly 
35 40 45 

. Leu Ala Ala Tyr ?he Phe Met Gly Arg Leu Pro Leu Leu Arg Arg Val ■ 
SO . '• 55 ' '60 

Leu Lys Gly Leu Gin Leu Leu Leu Ser Leu Leu Ala Phe lie Cys Glu 
; 65 70. 75 •. 80 

Glu Val Val Ser Gin Cys Thr Leu Cys Gly Gly Leu Tyr Phe Phe Gin 
85 90 95 

Phe Val Ser Cys ser Ala .. Phe Leu Leu Ser Leu Leu lie Leu lie Val 
100 ■ 105 no 

Tyr Cys Thr Pro Phe Tyr Glu Arg val Asp Thr Thr Lys Val Lys Ser 
115 ■ 120 * 125;* 

Ser .Asp Phe Tyr lie Thr Leu Gly Thr Gly Cys Val Phe Leu Leu Ala 
L30 135 1 % 1 140 

Ser He lie Phe Val Ser Thr His, Asp Arg Thr Ser Ala Giii He Ala 
145 ■ lbU • 155 ■ ■•• 160 

Ala He Val Phe Gly Phe He Ala Ser Phe Met Phe Leu Leu Asp Phe 
• 165 ' 170 , : -175 

He Thr Met Leu Tyr Glu Lys Arg Gin GIjl Ser Gin Leu Arg Lys Pro 
. , 180 185 '190 

Glu Asn Thr Thr Arg Ala Glu Ala Leu Thr Glu Pro Leu Asn, Ala 
19S _ 200 205 



WO 00/06698 



204 



PCT/US99/17130 



<210> 306 

<2U> 135 . . 

<212> PRT '■ . 

<213> Homo sapiens 

<400> 306 

Ala Ser Ala Pro Arg Val-Met Arc Gly His Leu Ala Gly Phe Pro Ala 
1 5 10 15 

Leu Ser Gly Leu Ala Ser Val Cys Leu Trp Ala Thr Phe Ser Ala Gin 

20 .25. ' ■ . 30 

Leu Pro Gly Pro Val Ala Ala Thr Ser Trp Thr Pro Ala Pro Leu "Gly 
35 40 -45 ■ ' ' 

Cys Ser Ala Ala Arg Ser Gly Pro Glu Lys Arg Lgu Gly Thr Ala Ala 
50 55 60 

Pro Gly Ser Ala Ala Ser Leu Ala Gin Ala Gly Pro Gly Ala Pro Cys' 
65 70 75 80 

Arg Val Leu Pro Val Asp Pro Ala Pro Ala Ala Leu Asn Val Axg Glu 
85 ^ 90 95 

Pro Gly Trp Leu Gly Gly Leu Phe Asp Gly Ala Leu Leu Gin Val Leu 
100 105 110 

Leu Asn , Phe Leu Arg Lys Ser Thr Asp Val Leu Met Asp Thr Arc Glu 
- \ . 115 120 125 

Ala Glu Ser Leu Glu Val Glu 
130 > 135 ' > 



<210> 307 ' ■ . 

<211> 188 
<212> PRT 

<213> Homo sapiens 
<40O> 307 

Asn Lys Leu His Ser Phe Pro Val Phe Leu Ser Gin Leu Leu Leu Asp 

1 ' . . '• 5 10 15 

Arg Gin Leu Leu His Ala Pro Gin Thr Leu Pro Thr Pro' His Cys Gly 
20 ; ' 25 30 

Gly Ser Ser Arg Pro Gly Pro Ser His Pro Pro Trp Leu Leu lie Gin 
35 40 45 

Leu Pro Cys Val His Veil Ala Leu Trp Gin Met Leu Arg Asp Phe. Ser 
50 55 • 60 

Asp Ser Arg lie Thr Pro Ser Thr Leu Thr Thr Gin Pro Ala Ala Gin 

65 , 70 75 ''BO 

Thr Ala. Ala. Pro Ala Lys Asp Gin Glu Ser Asp lie Val Gly Gly Glu 
85 90 95 



Gly He Leu Cys Asp lie Ala Phe Leu Gin. Glu Asp His Pro Leu Gly 
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100 / 105 110 

Val Gly Gly Ala Ser Ala Pro Server Arg Arg Glu Leu Ser-Arg Arg - 
115 ■ _ ' , '12D t 125 

Gly Val Kis Thr Gin Thr Leu Pro Glu Aep Gly Thr Leu His Gly Thr 
' 130 135 - 140 

Pro Ser Ser Ser Phe Asp Cys Gly ile'Lys Tyr lie He Ser Trp Pro 
145 =150 155 160 ' 

Leu Ala Pro Gly Cys Asp 'Leu Pro Ser Leu Glu Leu Ser Leu Val Cys 
165 ' 170' . 175 

Lys Gly Vai Ser Ser Cys Met Gly Phe Ala Ala' Gly 
180 ' * 185 



<210> 308 

<211> 78 

<212> PRT , 

<213> Homo sapiens ' . » 

■<400> 308 

Pro Gly Arg Pro- Thr Arg Pro. Thr Lys Asn Lys Val Cys Val Cys Leu 
1 5 10 * 15 

Gly Mec Leu Phe Trp Ala Tyr Pro lie Cys Val Phe lie Asp Ser Leu 
20 25 30 

Ser Cys Gin Pro Cys Leu Trp Ser Thr Gly Ala=Thr Ser His Phe Asn 
35 ' . 40 • . ' 45 

Ser Pro Thr Thr Ser Pro" Leu Phe Thr Leu Phe Met Pro Cys Ala Leu 
50 55 * 60 

Ala Pro Asn Pro Phe Thr Gin Leu Gly Lys Leu Asp Asp Arg 
65 70 ,75 



<210> 309 . • • 

<21i> 10 t 
<212> PRT 

<213> Homo sapiens ■ . 

<4oo> 309 ; ' ' 

Pro Val Asp , Leu Thr Lys Thr Arg Leu Gin 
1 5 10 



<210>'310 

*211> 10 

<212> PRT ' 

<213> Homo sapiens 

<400> '310 

Pro- Thr Asp Val Leu Lys lie Arg Met Gin 
1 5 10 
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<210> 311 

<211> 313 • < 

<212> PRT . . 

<213> Homo sapiens , . 

<220> 

<22I> SITE 

<222> (117) / 

<223> Xaa equals any of the naturally occurring I. -amino acids 
<400> 311 " ' ' ■ 

Met Thr Phe Cly Ser Thr Xle Ser Pro Thr Ser Thr His, Ala Ser Pro 
1 ' . 5 10 15 

Ser Leu Gly Phe Cys Cys Ser Trp Leu Leu Glu Asp v Leu Glu Glu Gin 

, . 20 '" . 25 30 

Leu Tyr Cys Ser Ala Phe Glu Glu "Ala Ala Leu Thr Arg Arg lie Cys 
35 40 45 

Asn Pro Thr Ser Cys Trp Leu Pro Leu Asp. Met, Glu Leu Leu His Arg 
50. ...... 55 • t 60 

Gin Val Leu Ala Leu Gin Thr Gin Arg val Leu Leu Gly Met Trp Leu 
65 • 70 75 80 

Arg Arg Ala Trp Asp Thr Trp Val Ser Pro Arg Arg Val Ala Pro Gly 
85 , -90 95 

Ser Arg Cys Leu Leu Thr Ala Ser Hi3 Pro Cys Thr Glu Lys Arg Arg 
100 105 " 110 

. Lys Ala Ser Ala Xaa Gin Arg Asn Leu Gly Tyr Pro Leu Ala Met Leu 
115, 120 125' 

Cys Leu Leu Val Leu Thr- Gly Leu Ser Val Leu lie val Ala lie His 
130 t ' ■ • '135 . .140 

lie Leu Glu Leu Leu lie Asp Glu' Ala Ala Met Pro Arg Gly Met Gin 
145 . ' 150 " 155' . 160 

Gly Thr Ser Leu Gly Gin Val Ser Phe Ser Lys Leu Gly Ser Phe Gly 
165 170 175 

, Ala Val lie Cln Val Val Leu He Phe Tyr Leu Met Val Ser Ser Val 
180 185 190 

Val Gly Phe Tyr 1 Ser Ser Pro Leu Phe Arg Ser Leu Arg Pro Arg Trp 
.195 200 . .205 

,His Asp Thr Ala Met Thr Gin He lie Gly Asn Cys Val Cys Leu Leu 
210 215 220 

Val Leu Ser Ser Ala Leu Pro Val Phe Ser Arg Thr Leu Gly Leu Thr . 
22S _ 230 - * 235 ' 240 

Arg Phe Asp. Leu Leu Gly Asp Phe Gly Arg. Phe Asn Trp Leu Gly Asn 
245 250 255 
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Phe> Tyr lie Val Phe Leu Tyr Asn Ala Ala Phe Ala Gly Leu Thr 'Thr ^ 
260 - ; 5 265 , * .270 

Leu Cys Leu Val Lys Thr Phe Thr Ala Ala Val Arg' Ala Glu Leu lie 
275 .' 2B0 285 

Arg Aid Phe Gly Leu Asp Arg Leu Pro Leu Pro Val Ser Gly Phe Pro 
290 295 ,300 

Gin Ala Ser Arg Lys Thr Gin His Gin 
305 310 



<210> 312 

<211> 92 

<212> PRT . 

<213> Homo sapiens . 

<400> 312 

Leu Cys Val Cys Leu "Val .Tyr Leu Cys Mot Tyr Gly Val Cys Leu Cys 
1 5 10 15 

Val He Val Cys Val Ser Gly Val Ser Leu Cys Leu Tyr Val Trp Gly 
20 25 30 

. Val <Scr Val Cys Asp Cys Val Ser Val Phe Met Cys Val Cys Leu Cys 
.35 40 45 

Val lie Phe Cys Val Tyr Gly Lys Pro Arg Thr Glu His Tyr His Ser 
50 55 60 

Pro His Leu Ala Lys Gin Lys Ala Phe Arg Glu Met Cys Gly Arg His 
65 70 75 - '80 

Asp Val Ser Ala Ala Gly He Phe Gin Ser Tyr Val 
85 90 



<21C> 313 v 
<211> 207 
<212> PRT 

<213> Homo sapiens 
' <400> 313 

Gly His Met Pro Tyr Gly Trp Leu Thr Glu He Arg Ala Val Tyr Pro 
1 5 .10 .15 

Ala Phe Asp Lys Asn Asn Pro Ser Asn Lys Leu Val Ser Thr Ser Asn 

2G - ; . 25 30 

■ Thr Val Thr Ala. Ala His lie Lys Lys Phe Thr Phe Val Cys Met Ala 
35 . 40 .45 

Leu Ser Leu 1 Thr Leu Cys Phe Val Met Phe' Trp Thr Pro Asn Val Ser 
50 55 GO 



Glu Lys He Leu He, Asp He He Gly Val Asp Phe Ala Phe Ala Glu 
65 ' 70 .75 80 
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Leu Cys Val Val Pro Leu Arg He Phe Ser ?he Phe Pro Val Pro Val' 
85 "' « 30 95 

Thr Val Arg Ala His Leu Thr Gly Trp Leu Met Thr Leu Lys Lys Thr 
100 105 . 110 

Phe Val Leu Ala Pro Ser Ser Val Leu Arg lie He val Leu He Ala 
115 , 120 125 

■Ser Leu Val Val Leu Pro Tyr Leu Gly Val His Gly Ala Thr - Leu Gly 
1-30 135* 140 

Val Gly Ser: Leu Leu Ala Gly Phe. Val "dy Glu Ser Thr Met Val Ala 
14S 150 155 160 . 

lie Ala Ala Cys Tyr Val Tyr Arg Lys Gin Lys Lys Lys Met Glu Asn 
165 ' 170 1 175 

Glu Ser Ala Thr Glu Gly Glu Asp fipr Ala Mpt Thr Asp Met Pro Pro 
180 185 190 

Thr Glu Glu Val Thr Asp He Val Glu Met Arg Glu Glu Asn Glu 
195 200 205 



<210> 314 t -. , 

<211> 1L4 ' , 

<212> PRT 

<213> Homo sapiens 

<400>' 314 . ' " ' 

Gin Val Val Phe val Ala He Leu Leu His Ser His Leu Glu Cys Arg 
1 5 • 10 15 

Glu Pro Leu Leu He Pro He Leu Ser Leu Tyr Met Gly Ala Leu Val 

"■■•*-■ 20 25 30 

Arg Cys Thr Thr Leu Cys leu Gly Tyr Tyr Lys Asn He His Asp He 
35 . .40 45 . * 

He Pro Asp Arg Ser Gly '■ Pro Glu Leu Gly Gly Asp Ala Thr He Arg 
50 55 * .60 

Lys Met Leu Ser Phe Trp Trp Pro Leu Ala Leu Tie Leu Ala Thr Gin 
65 ■ 70 75 ■ 80 

Arg He Ser Arg' Pro He Val Asn Leu Phe Val Ser Arg Asp Leu Gly 
85 90 95 

Gly ser Ser Ala Ala Thr Glu Ala Val Ala He Leu Thr Ala Thr Tyr 
100 105 • .110 

Pro Val . ' 



<210> 315. 
<211> 115 
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<212> PRT 

<213> Homo sapiens 

<400> 315 

Arg Cys Cys Cys Arg Gly Cys Ser Cys Arg Ala Arg Leu Cys Pro' Pro 
1 ' 5 • 10 ,0.5 

Ala Arg Ser Thr Ala Val Ala Pro Glu Cys Arg Gly Ala His Pro Ser 
20 25 30 

Arg Ala Met Arg Pro Gly Thr Ala Leu Gin Ala Val Leu Leu' Ala Val- 
35 " 4G 45 

Leu Leu Val Gly Leu Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Gin 

50 * 55' ,60 

Pro Val Cys Arg Gly Gly Thr Gin Arg Pro Cys Tyr Lys Val He Tyr 
65. 70 75 80 

?he His Asp Thr Ser Arg Arg T.mi Asn Php Glu Gin Ala T.ys Glu Ala 
85 90 95 

Cys Arg Arg Gly Trp Arg Pro Ala Ser Gin His Arg Vai" Leu Lys Met • 
100 105 110 

Asa Arg Asn 

115 ■ ■ 



<210> 316 

<211> 81 . * 

<212> PRT 

<213> Homo sapiens 

<400> 316 . • , 

Met Arg' Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val Leu Leu;' 
1 5 10 15 

Val Gly Leu Arg Ala 'Ala Thr Gly Arg Leu Leu Ser Gly Gin Pro Val 
20 25 30 

Cys Arg Gly Gly Thr Gin Arg Pro' Cys Tyr, Lys Val He Tyr Phe His 
35 40 " ' 45'' 

Asp Thr Ser Arg Arg Leu Asn Phe Glu G"_u Ala Lys Glu Ala Cys Arg 
50 55 60 ' • 

Arg Gly Trp Arg Pro Ala Ser., Gin Kis Arg Val Leu Lys * Met Asn Arg 

65 • 70 75 . . 80 



<210> 317 
<211> 290 
<212> PRT 
<213> Homo sapiens 
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<400> 317 . , , , . " ' ' 

lie Arg His Glu Cln Gin. Gly Glu Glu Asx> Asp Glu His Ala Arg Pro 

1 5 " '' -1 ; G ' . 15 

Leu Ala Glu Ser Leu Leu Leu Ala. He Ala- Asp Leu Leu Phe Cys Pro 

20 • :i{ 25 • .. 30 •■' 

Asp Phe Thr Val- Gin ser His Arg Arg Ser Thr Val Asp Ser' Ala Glu 

35 ' '' '■ 40 . • ' •■ 45 

Asp Val His- Ser Leu Asp Ser Cys Glu.Tyr He Trp Glu Ala Gly Val 

' 50 ' / . 55- ' . 60 • 

Gly- phe Ala His Ser Pro Gin Pro Ash Tyr lie His Asp Met Asn Arg 

' 65 , V . ; .70 : -75 • 80 

Met - Glu Leu Leu Lys Leu Leu Leu* Thr Cys Phe Ser Glu Ala Met Tyr 

• • . 85 ; . 90 - .' • < • 95 

Leu Pro Pro. Ala " Pro Glu .Ser Gly Ser Thr Asn' Pro Trp Val Gin Phe 

100 ' „ v :-- 105 " 110, • ' 

Phe Cys Ser Thr Glu Asn Arg His Ala Leu Pro Leu Phe thr Ser Leu 

115 . .. . ' 120 4 " 125. 

Leu Asn Thr Val Cys Ala Tyr .Asp Pro Val Gly Tyr Gly lie Pro Tyr 
130 135 " . . < 140 

Asn His Leu Leu Phe Ser Asp Tyr Arg Glu Pro Leu Val Glu Glu .Ala 
145 , ' ISO . ^ 155 160 

Ala Gin Val Leu He Val', Thr Leu Asp His Asp \ Ser Ala Ser Ser Ala 
165 ... 170 " 175'- 

Ser Pro Thr Val . Asp Gly Thr Thr Thr ' Gly Thr Ala Met Asp Asp, Ala . 
; 180 • \i. ■' 185 ' - 190 

Asp Pro Pro Gly Pro Glu Asn Leu. Phe Val Asr. Tyr Leu Ser Arg lie 

195 200 . - , 205 ■ . 

'His Arg Glu Glu Asp Phe Gin Phe lie Leii Lys Gly lie Ala Arg Leu 

210 .. • ' 215 ' — - .. ' N " " 220 •• 

Leu Ser- Ash Pro' Leu Leu Gin Thr • Tyr Leu Pro Asn Ser Thr Lys Lys 
225 * . ' 230 235 ' ' 240 - 

Asp Pro Val Pro Pro Gly Ala Ala Ser Ser Leu Leu Glu Ala Leu Arg 

245 • • . •■ -250u ' 255 

Leu Gin Gin Glu lie Pro Leu Leu Arg Ala Glu Glu Gin Arg Arg Pro 

' 260 .. ! ' , 265 ■ ':. '. / 270 

Arg His Pro Cys Pro His Pro Leu Leu Pro Gin Arg , Cys Pro Glv Arg 
275 280 ' 2 85 ' ' 

Ser val ' * '' 

290 . • 
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<21D> 318 

<211> 313. ■;.*■' ' 

<212> PRT '*'..-' 

<213> Homo sapiens 

<400> 318 

Arg Leu Val Tyr Asn Lys Thr Ser Arg Ala, Thr Gin Phe Pro Asp Gly 

1 5 t 10 - . * - ib 

Val Asp Val Arc? Val Pro Gly Phe Gly Lys Thr Phe Ser . Leu Giu ' Phft 
20 25 30 

Leu Asp 1 Pro Ser Lys Ser Ser Val Gly Ser Tyr Phe His Thr, Met Val . 
3 5 / ' 40 - 45 

Glu Ser Leu Val Gly Trp Gly Tyr Thr Arg Gly Glu Asp ,Val Arg Gly 
50 , , 55 : 60., 

Ala Pro Tyr Asp Trp Arg Arg Ala Pro Asn Glu Asn Gly Pro Tyr Phe 
65' 70 75 80 

Leu Ala Leu Arg Glu Met lie Glu Glu KeL Tyr Gin Leu Tyr Gly Gly 
85 90 95 

Pro Val Val Leu Val Ala His Ser Met Gly Asn Met Tyr Thr Leu Tyr . 
100 105 110 

Phe Leu Gin Arg Gin Pro Gin Ala Trp Lys Asp Lys Tyr ■ He Arg Ala 
115 ' » 120 . . ( 125 . 

Phe Val Ser Leu Gly Ala Pro Trp Gly Gly Val Ala Lys Thr Lou Arg 

130 ( . 135 140 ; " 

Val, Leu Ala Ser Gly Asp Ash Asn Arg lie Pro Val lie Gly Pro Leu 
145' \\ ' 150 . 155 ISO 

Lys He Arg 'Glu Gin Gin Arg Ser Ala Val , Ser Thr Ser Trp Leu Leu 
1^5 . . / 170 175 

Pro Tyr Asn Tyr Thr Trp Ser Pro Giu Lys Val Phe Val Gin Thr* Pro 
.190 * 165 190 

Thr lie Asn Tyr Thr Leu Arg Asp Tyr Arg Lys Phe Phe Gin Asp He 
,195 200 ' ' 205 - 

Gly Phe Glu Asp Gly Trp Leu Met Arg Gin Asp Thr Glu Gly Leu Vol 
210 * . . 215 220 

Glu Ala Thr Met Pro Pro Gly Val Gln^Leu His Cys Leu Tyr Gly Thr 
225, 230 ' 235 • 240 ' 

Gly Val Pro Thr Pro Asp Ser Phe Tyr Tyr Glu Ser Phe Pro Asp Arg 
,245 . . 250 " . ' . /, '255 

Asp Pro Lys He Cys Phe Gly Asp Giy Asp Gly Thr Val Asn Leu Lys 
2C0 " 265 , ■ 270 

Ser Ala Leu Gin Cys Glh Ala Trp Gin Ser Arg Gin Glu His Gin Val 
275 2R0 285 



WO 00/06698 



PCT/US99/17130 



212 



Leu, Leu Gin Glu Leu^'Pro Gly Ser Glu His lie Glu Met Leu Ala Asn 

290 ; . * ' 295 * 300 

Ala Thx Thr Leu Ala "Tyr Leu Lys Arg Val Leu Leu Gly Pro 
305 310 315 



<210> 319 ' ' r , ■ 

<211> 362 . ' *.-' 

<212> PRT ' * . v . •' 

. <213> Homo sapiens , '■■ 1 

<400> 319. - 

Met Asti Lys Glu Asp Lys Val Trp Asn Asp Cys Lys Gly Val Asn Lys 
1 . . ' 5 10 15 

Leu Thr Asn Leu Glu Glu Gin Tyr lie rle Leu lie Phe Gin Asn Gly 

• 20 ' 25 ■ ; • • 30 

Leu Asp Pro Pro Ala Asn Met Val Phe Glu Ser lie He Asn Glu lie 
35 40 45 

Gly lie Lys Asn Asn lie Ser. Asn Phe Phe Ala Lys lie Pro Phe Glu 
. 50 55 • , 60 

Glu Ala Asn Gly Arg Leu Val Ala Cys Thr Arg Thr Tyr Glu Glu Ser 
65 70 • • 75 80 

lie Lys Gly Ser Cys Gly Glh Lys Glu Asn Lys lie Lys Thr Val Ser 

'- 85 : 90 95 

Phe Glu Ser Lys Hie Gin Leu Arg Ser Lys Gin Glu Phe Gin Phe Phe 

ioo t los ; 110 

Asp Glu Glu Glu Glu Thr Gly Glu Asn His Thr He Phe He Gly Pro 
115 ' 120 / 125 

Val Glu Lys Leu lie Val Tyr Pro Pro Pro Pro Ala Lys Gly Gly He 
130 135 , 140 

Ser Val Thx Asn Glu Asp Leu His Cys Leu Asn Glu Gly Glu Phe Leu 
' 145 150 / 155 160 

Asn Asp Val Tie He Asp Phe Tyr Leu Lys Tyr Leu Val Leu Glu Lys 
165 170 175 

Leu Lys Lys Glu Asp Ala Asp Arg lie His He Phe Ser Ser Phe Phe 
Y*" 180 , .185 190 

Tyr Lys Arg* Leu Asn Gin Arg Glu Arg 'Arg Asn His Glu Thr Thr Ash 
•'• 195 \ 200 205, 

Leu Ser Ile'Gln Gin Lys Arg His Gly Arg Val Lys Thr Trp Thr Arg 
210 • 215 220 * 

'His Val Asp lie Phe Glu Lys Asp Phe He Phe Val Pro *Leu Asn Glu 
225 230 ^ . 235 . 240 
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Ala Ala His Trp Phc Leu Ala Val Val Cys Phe Pro Gly Leu Glu Lys 
245 250 '255- 

Pro Lys Tyr Glu Pro Asn Pro His Tyr His Glu Asn Ala val He Gin 
■260 265 270 

Lys Cys Ser Thr Val Glu Asp Ser Cys He' Ser Ser Ser Ala Ser Glu 
' 27S 200 285 

Met Glu Ser Cys Ser Gin Asn Ser Ser Ala Lys Pro Val He, Lys Lys 
290 295 ; 300 

Met Leu Asn Lys Lys His Cys He Ala Val He Asp Ser Asn Pro Gly 
305 , ■ 310- ■ 315 320 

Gin Glu Glu Ser Asp Pro Arg Tyr Lys 'Arg Asn He Cys Ser Val Lys 
325 .330 .335 

Tyr Ser val Lys Lys He Asn His Thr Ala Ser Glu Asn Glu Glu Phe 
340 345 350 

Asn Lys Gly Glu Ser Thr Scr Gin Lys 1 Ser 

360 





355 


<210> 


320 


<211> 


330 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SITE 


<222> 


(38) 


<223> 


Xaa equals any of 


<220> 




<221> 


SITE 


<222> 


(247) 


<223> 


Xaa equals any of 


<400> 


320 


Met Ser Pro Leu Ser Ala 


1 


5 



10 15 

Gly Ala Ala Val He Leu Ala GLn Leu Lou Arg Arg Cys Arg Gly GLy 
20 25 30 

Phe Leu Glu Pro Val Xaa Pro Pro Arg Pro Asp Arg Val Ala He Vai 
35 40 45 

Thr Gly Gly Thr ; Asp Gly He Gly Tyr Ser Thr Ala Asn He Trp Arg 
50 55 • 60 

Asp Leu Gly Met His Val He lie Ala Gly Asn Asn Asp Ser Lys' Ala 
65 70 75 80 

Lys Gin Val Val Scr Lys He Lys Glu Glu Thr Leu Asn Asp Lys Val 

■85 90 v ' 95 
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Glu Phe Leu Tyr Cys Asp Leu Ala Ser Mer Thr Ser He Axg Gin Phe 
100 105 , HO 

Val Gin Lys Phe Lys Met Lys Lys He Pro Leu His Val Leu He Asr. 
115 120 125 

Asn Ala Gly Val Met Met: Val Pro Gin Arg Lys Thr Arg Asp Gly Phe 
. 130 135 140 

Glu Glu His Phe Gly Leu Asn Tyr Leu Gly His Phe Leu Leu Thr Asn 
. 1'45 - .150 155' 160 

Leu Leu Leu Asp Thr Leu Lys Glu Ser Gly Ser Pro Gly. His Ser Ala 
.:' 155 • 170 . 175 1 

Arg Val Val Thr Val Ser Ser Ala Thr His Tyr' Val Ala Glu , Leu Asn 
180 , 185 190' 

Met Asp Asp Leu Gin Ser Ser Ala Cys Tyr Ser Pro His Ala Ala Tyr 

195 >*• 200 ' 205 

Ala . Gin Ser Lys Leu Ala Leu Val Leu Phe Thr Tyr His . Leu Gin Arg 
210 215 220 

Leu Leu Ala Ala Glu Gly ( Ser His Val Thr. Ala Asn Val Val Asp Pro 
225 2 3 0 235 240 

Gly Val Val Asn Thr Asp' Xaa Tyr Lys His Val . Phe Trp Ala Thr Arg 
245 250 255' 

Leu Ala Lys Lys Leu Leu Gly Trp - Leu Leu Phe Lys Thr Pro Asp Glu 
260 265 270 

Gly Ala Trp Thr ser .He Tyr Ala Ala Val' Thr Pro Glu Leu Glu Gly 
275 / ' 280 285 

Val Gly Gly Arg Tyr Leu Tyr Asn Glu lys Glu Thr Lys Ser Leu His 
220 • 295 . 300 

Val Thr Tyr. Asn Gin Lys Leu Gin Gin Gin Leu . Trp Ser Lys Ser Cys, 
305 ' "310 315 320 

Glu Met Thr Gly Val Leu Asp Val Thr Leu 
325 330 



<210> 321 ■ - ■ > 

<211> 71 • . " . , s 

<2i2> PRT ti ' 

<213> Homo sapiens 

<220> ' • ' 

<221> SITE • ■ 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-amir.o acids 
<400> 3 21 

Met Ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr Ala Val 
1 , . 5 .10 • 15 
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Gly Ala Ala Val He Leu Ala Gin Leu Lea Arg Arg Cys Arg Gly 'Gly 

20 25 30 

Phe Leu GLu Pro Val Xaa Pro Pro. Arg Pro Asp Arg Val Ala He Val 

. 35 . ' . . 40 45 

Thr Gly Gly Thr Asp Gly He Gly Tyr Ser Thr Ala Asn He Trp Arg 

50 55 60 

Asp Leu Ala Cys Met Leu Ser . 

65 70 •". , 



<210> 322 * 
<21I> 266 
<212> PRT 

<213> Homo sapiens 

<220> - ' 

<221> SITE 

<222> (97) * 

<223> Xaa equals any of ' the naturally occurring L-amino acids 
<220> 

<221> SITE . * , 

<222> (174) 

<223> Xaa equals any of the naturally occurring L-amirib, acids 
<220> 

<221> SITE . '* ■ ■ * 

<222> (195) 

<22 3> Xaa equals any o£ the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> (199) - 
<223> Xaa equals any o£ the naturally occurring L-amino acids 

<220> , 

<221> SITE 

<222> (206) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 322 

Met Glu Val Thr Thr Glu Asp Thr Ser Arg Thr Asp Val Ser Clu Tro 

1 , " • '■■ 5 .10 15 

Ala Thr ser Gly Gly Ala Ala Asp Gly Val Thr Sex lie Ala Pro Thr ' 
20 25 30 

Ala Val Ala Ser Ser Thr , Thr Ala Ala Ser Il« Thr Thr Ala Ala Ser 
35 40 45 

Ser Met -Thr Val Ala Ser Ser Ala Pro ,Thr Thr Ala Ala Ser Ser Thr 

50 55 ■ . " 60 ; 

Thr Val Ala Ser lie Ala Pro Thr Thr Thr Ala Ser Ser Met Thr Ala 
65 70 .75 80 
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Ala J Ser Ser Thr Pro Met Thr Leu Ma Leu Pro Ala Pro Thr' Ser Thr 
85 90 95 

Xaa Thr Gly Arg Thr Pro Ser Thr Thr Ala Thr Gly Kis Pro Ser Leu 
. 100 105 110 

Ser ,Thr Alii Leu Ala Gin Vai Pro Lys Ser ser Ala Leu Pro' Arg Thr 
115 120 ' 125 

Ala Thr Leu Ala Thr Leu Ala Thr Arg Ala Gin Thr Val Ala Thr Thr 

' 130 * 135 140 . t . , 

Ala Asn Thr Ser Ser Pro Met Ser Thr , Arg Pro Ser Pro Ser Lys His 
145 . .150 ' 155 160 

Met Pro Ser Asp Thr Ala Ala ser Pro Val Pro Pro Met Xaa Pro Gin 

165 . 170 \ , 175 

Ala Gin Gly Pro lie Ser Gin Val Ser Val Asp Gin Pro Val Va\ Asn 
.180 185 . 190 

Thx Thr Xaa Lys Ser Thr xaa Met Pro ser Asn Thr Thr Xaa Glu Pro 
195 200 205 

Leu Thr Gin Ala. Val Val Asp Lys Thr Leu Leu Leu Val Val Leu Leu 
210 215 : - 220 , 

Leu Gly Val Thr Leu Phe He Thr Val Leu Val Leu Phe Ala Leu Gin 
225 - 230 235 240 

Ala Tyx Glu Ser Tyr Lys Lys Lys Asp Tyr Thr Gin Val Asp Tyr Leu 

245 r : 250 1 ( 255 

He Asn Gly Met Tyr Ala Asp Ser Glu Met- ' 
260 ' , 265 



<210> 323 
. <211> 99 
<212> PRT 

<213> Homo sapiens 
<400> ,323 

Ala Arg Cys Pro Glu Leu Pro Gly Leu Arg Cys Arg Pro Arg Pro Arg 
1 . 5 . 10 15 

. ' . i 

Ala Gly Pro Gin Ala Pro Ser Tyr Cys Pro Arg Ala Thr Arg Pro Pro 
.20 25 ' . ' 30 

Gly Ala Cys Cys Ala Arg Ket Arg Leu Leu Leu Glu ' Trp' Arg Val Tyr 
35 40 45 

Leu Arg. Leu Thr Cys Ala Thr t Lys Asp Gly Met Ala Arg Glu Cys Pro 
50 ' 55* SO 



Thr Thr Trp Leu Ser Pro Pro Ala lys Pro Asp Phe Ala Gin Arg His 
65 70 75 80 
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Ser Val *Lys Pro Thr Ala Leu Gin Gly Gly' Arg Trp Ser Arg Leu Gly 
85 ' .90 • 95 



.Ala Ser ; Pro 



<21p> 324 - t . . 

<211> 96 

<212> prt : ',; 

<213> Hono sapiens 

<400> 324 

Leu Pro Ala Thr Val Glu Phe Ala Val , His Thr Phe Asn Gin 1 Gin Ser 
1 • 5 • ' • 10 • 15 

Lys Asp Tyr Tyr Ala Tyr Arg Leu Gly. His lie Leu Asn Ser Trp Lys 
20 25 ' .30 . 

Glu Gin Val Glu Ser L,ys Thr Val Phe Ser Met Glu Leu Leu Leu Gly 
' 3 5 40', 45 

Arg Thr Arg Cys Gly Lys Phe Glu Asp Asp He Asp Asr. Cys .His Phe 
50 ■ ; 55 ' 1 ■ . 60 

Gin Glu Ser Thr Glu Leu ,Asn Asn Thr Phe Thr Cys Phe Phe Thr He 
S5 70 75 80 

Ser Thr Arg Pro Trp Met Thr Gin Phe Ser Leii Leu Asn Lys Thr Cys. 
' 65 90 ' 95 



■ <210> 325 
<211> 166 . 
<212> PRT' 

.<213> Homo sapiens ' 
<400> 3 25 s 

Leu Leu Trp Ala Arg Gly Leu Gly Arg Ala Lys Ser Ala Val Pro Thr 
1 5 . 10 15 

Val Ser Thr Ket Leu Gly Leu Pro Trp Lys Gly Gly Leu Ser Trp Ala 
20 25 . 30 

Leu Leu Leu' Leu Leu Leu Gly Ser Gin. ile Leu Leu He Tyr Ala Trp 
35 40 45 

His Phe His Glu Gin Arg Asp Cys Asp Glu His Asn Vol Ket Ala Arg 
50 .55" 60 

Tyr Leu Pro Ala Thr Val Glu Phe Ala. Val His Thr Phe Asn Gin Gin 

65 ' .70 ■ • 75. • 80 



Ser 



Lys Asp Tyr Tyr Ala Tyr Arg Leu Gly His lie Leu Asn Ser Trp 
85, 90 -95 
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Lys Glu Gin Val Glu Ser Lys Thr. Val Phe* Ser Met Clu Leu Leu Leu 
' ? 1 . ' .100 105, . 110 

Gly Arg Thr Arg Cys Gly Lys Phe Glu Asp Asp He Asp Asn Cys His 
115 120 ' 125 

?he Gin Glu Ser Thr Glu Leu Asn Asn Thr Phe Thr Cys Phe Phe Thr 
130 135 . 140 ; 

lie Ser Thr Arg Pro Trp Met Thr Gin Phe Ser Leu Leu Asn Lys Thr 
145 - 150' 155 ' ' 160 

Cys Leu Glu Gly Phe His '', . - 

, 165 



<210> 326 

<211> 214 . ' 

• <212> PRT v ■ 1 

<213> Homo sapiens 

<220> • 

<221> SITE . , 

<222> (200) • ' '-' 

<223> Xaa equals any of the naturally occurring L-amino acids 

' <220> ' . ' • 

<221> SITE 
<222> (205) 

<223> Xaa equals any of the naturally, occurring' L-amino acids 
<400> 326 

Leu Glu Gin. Lys Leu Glu Leu His Arg Gly Gly Gly Arg Ser Arg Thr 
1 5 10 , 15 

Ser Gly Ser Pro Gly Leu Cln Glu Phe Gly Thr Arg Glu Glu Arg Gly ' 
20 25 • 30 

Glu Gly Glu Gin Axg Thr Gly Arg Glu Phe Ser Gly Ash Gly Gly, Arg 
. 35 . . 40 . : 45 ' ; . 

Ala Val Glu Ala Ala Arg Met Arg Leu Leu Cys Gly Leu Trp Leu Trp * 
50 55 60 

Leu Ser Leu Leu Lys Val Leu Gin Ala. Gin Thr Pro Thr Pro t.eu Pro 
' 65 70 ' 75 80 

Leu Pro Pro Pro Kqt: Gin Ser Phe' Gin Gly Asn Gin Phe Gin Gly Glu 
85 . \ 90* 95 

■ Trp Phe' Val Leu Gly Leu Ala Gly Asn Ser, ?hc Arg Pro Glu His Arg 

' . 100 • ' . . • ' 105 , .110 

Ala Leu Leu Asn Ala ; Phe Thr Ala Thr Phe Glu Leu Ser Asp Asp Gly 
115 120 125 

Arg Phe Glu Val Trp Asn Ala Met Thr Arg Gly Gin His Cys Asp Thr 
130 • 135 140 



WO 00/06698 



219 



PCT/US99/17130 



Trp Ser Tyr Val Leu lie Pro Ala 
145 - ■ 150 

Asp His Gly Val Gly Arg Ser Trp 
165 

Gin Phe lie Cys Leu Gly Arg Ala 
' : 180 

val Phe Pro Asp Val Thr Gly Xaa 

195 200 

Trp Val Ala Ala Pro Ala 
210 



Ala Gin Pro Gly Gin Phe Thr Val 

'155 , 160 

Leu Leu Pro Pro Gly Thr Leu Asp 
•170 175 

Gin Gly Leu Ser Asp Asp /Asn lie 
185 190 '; 

Ala Leu Aso Leu Xaa Ser Leu Pro 
205 



<210> 32*7 

<211> 181 

<212> PRT' 1 

<213> Homo , sapiens . ., 

<400> .327 

Met Cys Val Cys.glu Arg Lys Airg Gly Arg Glu Lys Glu Gly Gly Val 
1 5 10 15 

Thr Pro Thr Met Thr Ser Asn Phe Pro Phe Cys Thr Leu lie Leu Gly 
20 . "25 ' 30 

lie Ala Gin Ala Gin Ala Cys Pro Gly Cys Pro Gly Asp Trp Pro Gly 
35 40 45 

Leu Gly Ser Gly Val Gly Glu Gly Leu His His lie Arg Thr Cys Arg 
50 55 . 60 

Thr Pro He Pro Cys Ser Pro Pro Ala Pro Ala Ala Ala Cys Leu Gly 
65 • ' 70 75 80 

Ser Cly His Ala Arg Leu Pro Cys Val Leu Arg Leu Trp Pro Val Pro 
85 90 95 

Ala Asn Leu Ser Ser Pro Phe Arg Leu Glu Ala Leu His Cys Ser Phe 
100 105 110 

Trp Ser Ser Pro Leu Leu Pro Ala Pro His Leu Ala Phe Phe Gly Phe 

" 115 *'- ' * 120 125 

Arg Asp Leu Leu Thr Asp Phe Leu Leu Ala Ala Cys Leu Leu Thr Phe 
130 , 135 140 

Gin Lys Thr Pro Leu. Glu Leu Pro Met Ala Val Val His Leii Leu Val 
145 150 ' "155 160 

Ala Thr -Pro Cys Tyr G_n Met Leu Asp Asn Leu Pro Leu Pro Ser Ala 
! 165 170 • ' 175 



Ala Ala Asn Trp Cys, 
180 
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f <210> 328 • .v ' 

<2.U> 195 1 
<212> PRT / • 

<213> Homo sapiens 

<400> 328 " . " 

Tyr Leu Trp Gly Arg Pro Arg Leu Arg Met Arg Ala Gly Thr ' Ser Pro 

1 , - . ' 10 ' 15 

Ser Ala Pro Trp Gly Glu Lys Arg Glu Lys Leu Gly His Lys Leu Pro 
20 "25 • .30 

Val Ala Leu Gin Gly Tyr His Pro Trp lie Leu Leu Glu Cys Thr Val 
35 40 45 

Phe Trp Ala Arg Val Val r.eu Ala Cys Phe Ser Leu Tyr Leu lie Arg 
50 ' ' ' 55 60 

Gly, Pro Asn, Cys lie' Ash Arg Gin Pro Glu Pro Thr Tyr Gin Lvs Ala 
65 70 75 60 

' Cys Asn Leu Asp Cys Ser Ser, Asp ,?he Gly Gin Glu Arg Ala Pro Ala 
85 90 95 

'* Trp Glu Leu Leu Gly Pro Glu Ser Glu Gin Arg Leu Arg Glu -Tyr Thr 
100 105 110 

Ala Gin Gly Leu Gin Ser Leu Ala Ser Ser His Arg trp Arg Gin Phe 
115 * 120 125 

Lys Thr Glu Gly Lys Met Arg Gly Gly Ala Ser Pro Leu Pro Trp Leu 
130 135 140 

lie Cys Phe Trp Leu Cys Ser Tyr Lys Gly Ser Asp Asn Ser Leu Lys 
'1^5 150 155 « 160 

Pro Val Val Pro Gly Pro Thr Leu Cys Pro Gin Ser Leu Val Ser Pro 
165 .170 . 175 

Ser Val* His Pro Ser' Thr Arg Ser Ala Ser Leu Gly Arg His Arg Ala 
180 .'. '185 • 190 ' 

Glu Ala Ala 

'195. ., ■ " ••" : 



<210> 329 ' ■ 

<211> 50 ; * • 

■ <212> PRT. / _ , 

<213> Homo sapiens 

<400> 329 ■ ■ : f ■ 

Met Pro' Gly lie Leu Ala Gly He Pro Val Lys Asp Leu Cys Leu Ser 
1 . •. 5 .. 10 15 

Leu Leu Gin Gly Phe Arg Leu Leu Leu Leu Cys Val Cya Pro Gly Trp 

20 ' 25 30 



Leu Ser Gly Trp Met Gly Gly Gin Lys Gly. Ser Pro Arg lie Val Asp 
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35 40 45 



He Gly 
50 



<210> 330 ' • ■ 

<211> 90 ' 
<212> FRT ■'. 
<213> Homo sapiens • 

<400> 330 

Ala Lys Gly Glu Glu Artj Lys Glu Ala Phe Ser : Leu Lys Met Val Gin 
1 ' • 5 ' 10 . •. 15 

Leu Ser Ser Glu , Pro -Tie Ser Phe Gly Leu Met Tyr^Leu Tyr Lau H Gly. 
• * 20, • 25 30 

Val Phe Phe His Leu lie Tyr Pro Gly Ala Leu Ser Tie Thr Thr Leu 
35 . 40 45 

Gly' Lys His Ser His Pro Phe Phe Thr Ala Glu Gin Asn Sex Thr Val 
50 55 60 , 

Trp Met Glu. His Thr Leu Phe His Gin Ser Pro Val Ala Ser His Leu 
' 65 ' , 70 75 ■ 80 

Val Cys Phe Gin Ser Phe Ala Phe Ser Glu 
85 .90 



<210> 331 • , , 

<211> 56 - ' \ ' ' , 

<212> PRT 

<213> Homo sapiens 

<40O>*331 

Gly Pro Ala His Pro Aid. Ser Pro Pro Leu Met Thr Leu Ser Leu Gin 
1 ,5 10 15 

Leu Ala Glu Leu Val His Phe Val Cys Ala Phe Gin Ser Gin Trp Thr 

, 20 ' 25 ' ' 30 

Gly Val Tyr Pro Met Met Pro Pro leu Lys Pro Thr Glu Pro Leu 'Cys 
35. 40 45 

Phe Ala Cys Val Pro Cys Arg Val 

50 55 ' ' 



<210> 332 - 

<211> .13 ' 

<212> PRT . 

<213> Homo sapiens 

<400> 332 • 
Met Leu Leu Glu Val Tyr Gly Asp Ser He Ser Val Thr Val Ala He 

1 5 ' ' ' '• 10 15 
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222 



Pro Leu 



<210> 333 
<211> 19 

■ <212> PRT t ' . 
<213> Homo 'sapiens ' 

<400> 333 ' 

Met His Ser Pro Cys Gin Ser Lys Ala Ala Asp Gly Leu' Gly Lys Ser 

1 5 10 , .15 

< Glu Thr Glu " . . 



<210> 334 

<211> 10 , ' 

<212> PRT . 
<213> Homo sapiens 

<400> 334 

Met Leu Lys Ser Leu Gly Leu Ser Thr Asn 

1 . .5 ,10 

<210> 335 

<211> 200 ' ( 
<212> PRT 

<213> Homo sapiens ' ' ' 

<400> 335 " \ ' 

Ala Gin Arg Leu Ala Glu Glu Cys Phe Tyr Met Leu Leu Glu Val Tyr 

1 • • •' ' 5 ( ' 10 15 

Gly Asp Ser lie Ser Val Thr Val Ala' He Pro Leu Met His Ser Fro 

20 , 25 30 ■ 

Cys Gin Ser Lys Ala Ala Asp Gly Leu Giy Lys Ser Glu Thr Glu Met " 
• 35 , 40 . • 45 

Leu Lys Ser Leu Gly Leu Ser Thr Asn Met Ser fro Phe His Leu Leu 
50 55 ' 60 

Gly I.eu Lys Val Phe Leu Thr Trp Ala Leu Thr Leu Ala Gin lie Cys 
65 70 75 - 80 

Leu Tyr' Phe' Phe Glu Val Gin Pro Leu Gly Leu Leu Ala Leu Asn Phe 
05 90 .95 

Phe Cys Thr Ala- Thr A^a Gly Leu Lys Glu Leu Cys Met His Pro Pro 
100 , 105 110 

Ser Leu Ala" Phe Thx Pro Glu Phe His Thr Ser Leu Ser Pro Leu Ala 
115 120 ,. 125 

. lie Pro Ser Phe Cys Gly Thr Ser Val Ser, Leu Ser Asn Ser His Thr 
130 135 140 
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223 



j-le Pro Leu Ser Leu Tyr Liu Pro Phe Pre Ser Lys Ser Arg Met Pro 
145 150 155 160 

Asp Thr Leu - His Leu Leu Val His Ser : Leu Pro Leu Val His Ser Gin 
165 •• 170 175 

Val Lou ; Pro Val Lys Asp Val Thr He Glu Trp Pro Leu Cys Gin Arg 

... . 130 185 190 

* Cys Leu Gly Ser ' Thr Cys His Gin 
. 195 200 



<210> 336 . 

<211> 99 . 

<212> PRT ' ' 

<213> Homo sapiens 

* <220> ' ' " . 

' <221> ' SITE 
<222> {,94) .' * 

<223> Xaa equals any of the naturally occurring L-tunirio acids 

<220> . 

<221> SITE, 

' <222> (99) * 

<223> Xaa equals any 'of the naturally occurring L-amino acids 

<40O> 336 . ' ' 

Trp He Pro Arg Ala Ala Gly He Arg His Glu Val -Gin Val Ser Leu 
1 5 ■ 10 15 

Phe Gin Met Phe cys Phe Ser ser He Phe cys Ser His Glu His Thr 
.20 25 30 

His Leu Pro Gly Thr Phe Trp Leu Phe Leu Phe Leu Phe Leu lie Leu 
35 1 40 45 

Pro Pro Ser Cys Pro Cys Phe Leu Pro Phe Ser Leu Ala He Glu Thr 
■50 55 60 

Val Arg Trp Pro' Cys Trp His His Pro Thr Ser Phe Glu Leu Cys Tyr 
65 ,70 - 7b 80 

Pro Gly Thr Ser He Tyr Tyr Ala "Ser Arg Gly Gly Pro Xaa' Pro Asn 
.85 90 95 

Ser Glu Xaa 



<210> 337 t 

<211> 96 " 

<212> PRT 

<213> Homo. sapiens 

<220> 

<221> KITE ' ' ' 
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. . • .224" . . 

<222> (1) v 

<223> Xaa equals any of the naturally occurring L-ainino acids' ' 

^220> . t 

<221> SITE 

<222> (3) ' ■ ' . , , 

<223> Xaa equals any of the naturally occurring L -amino acids 



<400> 337 . 

Xaa Asn Xaa Lys Ser Pro Leu Thr lie Gly Asn Lys Ser Trp Ser Ser 

. 1 ' 5 . . 10 . 15 

Thr Ala, Val Ala Ala Ala Leu Glu Leu'^Val Asp Pro Pro Gly Cys Arg V 

' ' ' , 20 25 '.' ', • '30 

■ * • ** 

Asn Ser Ala Arg Asp Ser Pro Glu Leu Val His Leu Gly Lys Gly Arg 
.•35 40 45 

Pro Arg Lys Leu. Met Thr Tyr Leu Phe Cys Ser Ser lie Ser Leu Leu 
53 .55 60 

Leu Leu Lys Val His Ser Ser Gly His Gin Asp He' Arg Lys Ala Lys 

65 r 70 75*'. 30 

Ser Lys Val Pro Arg Leu Leu He lie Gin Cys Pro Gin Gin Arg Glu 
B5 . '90 95 



<210> 338 '. . ; . 

<211> 54 , . 

<212> PRT 

<213> Homo sapiens 
<400> 338 

Gly Pro Glu Glu Asn Leu Sex Pro Ser Thr Pro Ser Gin Met Pro Thr 
1 '5 10 15 

He Trp Val Lys Leu Cys -Leu Leu Gin Val' Cys His' Gly Leu Phe Pro 
20 • 25 30 

Leu Leu Lys His Trp Ser Gin Pro Met Pro Leu Cys Val Thr Leu Ala 
35 40 45 

Pro Val Ser Tyr Trp Leu 
50 

<210> 339 • T ". • • : 

<2U> 287 
<212> PRT 

<213> Hoir.o sapiens 

<400> 339 • ' 

Pro Arg Val Arg Lys Glu Pro Glu Ala Met Gin Trp Leu Arg Val Arg 
- 5 10 15. 
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Clu Ser Pro Gly Glu Ala Thr Gly His Argr Val Thr Met Gly Thr Ala 
20 ' 25 30 

Ala Leu Gly Pro Val Trp Ala Ala Leu Leu Leu Phe Leu Leu Met Cys 
35 . \ t 40 45 

Glu lie Pro Met Val Glu Leu Thr Phe Asp Arg Ala Val Ala Ser Asp 
50 55 SO 

Cys Gin Arg Cys Cys Asp Ser Glu Asp Pro Leu Asp Pro Ala His Val 
65 70 75 ' 80 

Ser Eer Ala Ser Ser Ser Gly Arg Pro His Ala Leu Pro Glu lie Arg 
85 . 90 . 95 

Pro Tyr Tie Asn lie Trur lie Leu Lys Gly Asp Lys Gly Asp Pro Gly 
100' 105 110 

?ro Met Gly Leu Pro Gly Tyr Met Gly Arg Glu Gly Pro Gin Gly Glu 
115 120 125 

Pro Gly Pro Gin Gly Ser Lys Gly Asp Lys Gly Glu Met Gly Ser Pro 
. "130 135 '• 140 

. Gly Ala Pro Cys Gin Lys Arg Phe Phe Ala Phe Ser Val Gly Arg Lys . 
145 ISO 155 160 

Thr Ala Leu His Ser Gly Glu « Asp Phe Gin Thr Leu Leu Phe Glu Arg 
165 170 , 175 

. Val Pne.Val Asn Leu Asp Gly Cys Phe Asp .Met Ala Thr Gly Gin Phe 
180 '' 185 190 

Ala Ala Pro Leu Arg Gly lie Tyr Phe Phe Ser Leu Asn Val His Ser 
195 200 205 

Trp Asn Tyr Lys ,Glu Thr- Tyr Val .His* I-e Met His Asn Gin Lys Glu 
210 • : 215 ' 220 • 

Ala Val lie Leu Tyr Ala ,Gln" Pro Ser Glu Arg Ser He Met Gin Ser 
'225 , 23 0 235 .240 

Gin Ser Veil Met Leu Asp Leu Ala Tyr Gly Asp Arg Val Trp Val Arg 
245 ' 250 255 

Leu Php Lys Arg Gin Arg Glu Asn Ala He Tyr Ser Asn Asp Phe Asp 
260 ^ 265 270 

Thr Tyr He Thr Phe Ser Gly His Leu He Lys Ala Glu Asp Asp ' 
.275 2R0 ' 285 



<210> 340 

<211> 33 9 

<212> PRT 

. <213> Homo sapicnb 

<400> 340 - ' 

Met Leu Tyr Pro Gly Ser Val Tyr Leu Leu Gin Lys Ala Leu Met Pro 
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1 5. . 1 10 15 

Val Leu Leu Gin Gly Gin Ala Arg Leu Val Glu Glu Cys Asn Glv Arg 
20 25 30 

Arg Ala r.ys T.pj Leu Ala -Cys Asp Cly Asn Glu lie Asp Thr Met Phe 
3 5 .40' 45 

■ t 

Val Asp Arg Arg Gly Thr Ala Glu Pro Gin Gly Gin Lvs Leu Val He 
50 . ; '55 ' • 60 

Cys Cys , Glu Gly Asn Ala Gly Phe Tyr Glu Val Gly 1 ' Cys Val Ser Thr 
65 70 75 - 80 

Pro Leu Glu Ala Gly. Tyr Ser Val Leu Gly Trp Asn His Pro Gly ?he 
85 90 ; , 95 

Ala Gly Ser Thr Gly Val Pro Phe Pro Gin Asn Glu 'Ala Asn Ala Met 
100 105 110 

Asp Val Val Val Gin. Phe Ala 'lie His Arg Leu Gly Phe Gin 'Pro Gin 
115 120 ' ; 125 _ h ■• 

Asp He He 'lie Tyr Ala Trp Ser He Gly Gly Phe Thr Ala Thr Trp 
130 135 140 

Ala Ala Met Ser Tyr Pro Asp Val Ser Ala Met lie \ Leu Asp Ala Ser 
145 150 155 . 160 

Phe Asp Asp Leu' Val Pro Leu Ala Leu Lys. Val Met Pro Asp Ser Trp 
165 . v 170 175 

Arg Gly Leu Val Thr A~g Thr Val Arg Gin His Leu Asn Leu Asn Asn 
180 18-5 ' 190 

Ala Glu Gin Leu Cys Arg Tyr Gin Gly Pro Val Leu Leu' He Axg Arg. 
' 195 ' 200 ■ * 205 

Thr Lys Asp Glu He He Thr Thr Thr Val Pro Glu Asb He Met Ser ' 
210 215 220 

Asn Arg Gly Asn Asp Leu Leu Leu Lys Leu Leu Gin His Arg Tyr Pro 
225 230 . .235 240 

Arg Val Met Ala Glu Glu Gly Leu Arg Val Val Arg Gin Trp Leu Glu 
245 . 250 255 

Ala Ser Ser Gin Leu Glu Glu Ala Ser He Tyr Ser Arg Trp Glu Val 
260 , - 265 270 

Glu Glu Asp Trp Cys Leu Ser Val Leu Arg Ser Tyr Gin Ala Glu Kis 
275 280 ' 285 

Gly Pro Asp Phe Pro Trp Ser Val Gly Glu Asp Met Ser Ala Aso Gly 
290 295 , -300 . ; 

Arg Arg Gin Leu Ala Leu Phe Leu' Ala Arg Lys His Leu His Asn Phe 
305 310 315 320 
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Glu Ala Thr His Cys Thr Pro Leu Pro Ala Gin Asn Phe Gin Met Pro 

: 325 ' 330 335 / 

Trp His Leu 



<210> 341 .. ' • 

<211> 127 . 
<212> PRT 

<213> Homo sapiens 1 

<400> 341 „, ■ . • 

Val Cys Pro Lys Trp Cys Arg Phe Leu. Thr Met Leu Gly His Cys Cys ' 
1 5 10 15 

Tyr Phe Trp Gin Val Trp Pro Ala' Scr Glu Ala Leu Ala Ala Gly Pro 
20 25 30 

Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser Trp Lys Gin His lie Gly 
35. . 40 45 , 

Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu Pro Thr Thr Thr Leu Thr 

50 55 .,. . 60 

Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly Lys Asn Pro Ala Ala Gly 
65 70; 75 80 

: Arg Ser Leu Glu Gly. Ala Leu Pro Ala Gly Val Trp Pro Cys Phe Ala 
85 90 95 

Gln.Ser Pro Cys Thr Gly Gly Gin Gin Thr Pro Ser Ser Thr Gly Leu 
100 105 110 

Arg Ser Cys Leu Val Arg Ser Pro Ala Thr Trp Trp Arg Thr Pro • 

... US 120 125 



<210> 342 

<211> 554. ♦ " ' 

<212> PRT 

<213> Homo sapiens 

<220>- 1 
<221> SITE 
<222> (16) 

<223> Xaa equals any of the naturally occurring L-araino acids 
<220> 

<221> SITE '■*.■'■ ''..*■ 

<222> (109) 

<223> Xaa equals any of the naturally occurring L-amino acids' 
<400> 342 . 

Trp He Pro Arg Ala Ala .Gly He Arg His Glu He Tyr Arg Glu Xaa 
1 5 10 * 15* 

Asp Ser Glu Arg Ala Pro Ala Ser * Val Pro Glu Thr Pro Thr Ala Val 
20 25 30 
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Thr Ala Pro His Ser Ser Ser .Trp Asp Thr Tyr. Tyr Gin Pro Arg Ala 
35 ,40 45 

Leu Glu Lys His Ala Asa Ser He Leu Ala Leu Ala Ser Val Phe Trp 
50 55 60 

Ser He Ser Tyr Tyr ser Ser Pro Phe Ala Phe Phe Tyr Leu Tyr Arg 
65 70 75 80 , 

Lys Gly Tyr Leu Ser Leu Ser Lys Val Val Pro Phe Ser His Tyr Ala 

85' - ^ 90 95 

Gly. Thr Leu 'Leu Leu Leu Leu Ala Gly. Val Ala Cys Xaa Arg Gly He 
100 105 . 110 

Gly Arg Trp Thr Asn Pro Gin Tyr Arg Gin Phe ' lie Thr He Leu Glu ' 
115 120 125 

Ala Thr His Arg Asn Gin Ser Ser Glu Asn Lys Arg Gin Leu Ala Asn 
130 135 140 

Tyr Asn Phe Asp Phe Arg Ser Trp Pro Val Asp Phe His Trp Glu Glu 
145 150 155 160 

Pro Ser Ser Arg Lys Glu Ser Arg Gly Gly Pro Ser Arg Arg Gly Val 
165. 170 * 175 

Ala Leu Leu Arg Pro Glu Pro Leu His Arg Gly Thr' Ala Asp Thr Leu 
1B0 185 190 - 

Leu Asn Arg Val Lys Lys Leu Pro Cys Gin He Thr Ser Tyr Leu Val 
195 200 205 

Ala His Thr Leu Gly Arg Arg Met Leu Tyr Pro Gly Ser Val Tyr Leu 
210 215 220 

Leu Gin Lys Ala Leu Met Pro Val Leu Leu Gin Gly Gin Ala Arg Leu 

225' . 230 J 235 .- 240 

Val Glu Glu Cys Asn Gly Arg Arg Ala Lys Leu Leu Ala Cys Asp Gly 
245. " 250 255 

Asn Glu He Asp Thr Met Phe Val Asp Arg Arg Gly Thr Ala Glu Pro 
260 265 270 

Gin Gly Gin Lys Leu Val He Cys Cys Glu Gly Asn Ala Gly Phe Tyr 
275 ,280 235 

Glu Val Gly Cys Val Ser Thr Pro Leu Glu Ala Gly Tyr Ser Val Leu ' 
290 295 300 

Gly Trp Asn His Pro Gly Phe Ala Gly ser Thr Gly Val Pro Phe Pro 
305 310 315 320 

Gin Asn Giu Ala Asn Ala Met Asp Val Val Val Gin Phe Ala He His 
325 330 335 

Arg Leu Gly Phe Gin Pro Gin Asp lie He lie Tyr Ala Trp Ser He 
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340 345 350 

Giy Gly Phe Thr Ala Thr Trp Ala Ala .Men Ser Tyr Pro Asp Val ser 

355 ■ . ' ' 360 365 

Ala Met lie Leu Asp Ala Ser Phe Asp Asp Leu Val Pro Leu Ala r,ou 
370 375 390 

Lys Val Met Pro- Asp Ser Trp Arg Gly Leu val Thr Arg Thr val Arg 
385 390 395 400 

Gin His Leu Asn Leu Asn As.i Ala Glu Gin Leu Cys Arg Tyr Gin Gly 
405 410 415 

Pro Val Leu Leu lie Arg Arg Thr Lys Asp' Glu He lie Thr Thr Thr 
420 425 430 

Val Pro Glu Asp lie Met Ser Asn Arg Gly Asn Asp Leu Leu Leu Lys 

435 440 445 \ 

Leu Leu Gin His Arg Tyr Pro Arg Val T Met Ala Glu Glu Gly Leu Arg 
450 455 460 

Val Val Arg Gin Trp Leu Glu Ala Ser Ser Gin Leu Glu Glu Ala Ser 
465 470 475 4B0 

He Tyr Ser . Arg Trp Glu Val Clu Clu A3p Trp Cys Leu Ser Val Leu 
485 490 ' - 495 

Arg Ser Tyr Gin Ala Glu His Gly Pro Asp Phe Pro Trp Ser Val Gly 
500 505 510 

Glu Asp Met Ser Ala Asp Gly Arg Arg Gla Leu Ala Leu Phe Leu Ala 
515 520 525 

Arg Lys His Leu His Asn Phe Glu Ala Thr His Cys Thr Pro Leu Pro 
530 535 ' 540 



Ala Gin Asn ?he Gin Met Pro Trp His Leu 
545 550 



<210> 343. 
<211> 225 
<212> PRT 

<213> Horao sapiens 
<220> 

<221> SITE 
<222> (5) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 343 

His Glu Arg Ala Xaa Gly Pro Ser Arg Gly His Gly Glu Leu Leu Ser 
1 5 10 15 



Cys Val Leu Gly Pro Arg Leu Tyr Lys He Tyr Arg Glu Arg Asp Ser 
20 25 30 
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Glu Arg Ala Pro Ala Ser Val Pro Glu Thr Pro Thr Ala Val Thr Ala 
35 40 45 

Pro His Ser Ser Sec Trp Asp Thr Tyr Tyr Gin Pro Arg Ala Leu Glu 
50 55 60 

Lys His Ala Asp Ser lie Leu Ala Leu Ala Ser Val Phe Trp Ser He 
€5 70 ' ■ 75 80 

Ser Tyr Tyr Ser Ser Pro Phe Ala Phe Phe Tyr Leu Tyr Arg Lys Gly 
85 90 95 

Tyr Leu Ser Leu Ser Lys Val Val Pro Phe Ser His Tyr Ala Gly fhr 
100 105. HO 

Leu Leu Leu Leu Leu Ala Gly Val Ala Cys Ser Glu Ala Leu Ala Ala 
115 1 120 125 

Gly Pro Thr Pro ser Thr Gly Ser Ser Ser Pro Ser Trp Lys Gin His 
130 135 140 

Tie Gly Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu Pro Thr Thr Thr 
145 150 155 160 

Leu Thr Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly Lys Asn Pro Ala 
165 170 175 

Ala Cly Arg Ser Leu Glu Gly Ala Leu Pro Ala Gly Val Trp Pro Cys 
180 185 190 

Phe Ala Gin Ser Pro Cys Thr Gl.y Gly Gin Gin Thr Pro Ser Ser Thr 
195 200 ' 205 

Gly Leu Arg Ser Cys Leu Val Arg Ser Pro Ala Thr Trp Trp Arg "hr 
210 215 • . 220 

Pro 
225 ' 



<210> 3 44 

<211> 299 

<212> PRT 

<213> Homo sapiens 

<400> 344 

Met Phe Lys Arg His Gin Arg Leu Lys Lys Asp Ser Thr Gin Ala Glu 
1 5 10 15 

Glu Asp Leu Ser Glu Gin Glu Gin Asn Gin Leu Asn Val Leu Lys Lys 
20 25 30 

His Gly Tyr Val Val Gly Arg Val Gly Arg Thr Phe Leu Tyr Ser Glu 
35 40 45 

Glu Gin Lys Asp Asn He Pro Phe Glu Phe Aop Ala Asp Ser L*u Ala 
50 55 60 

?he Asp Met Glu Asn Asp Pro Val Met Gly thr His Lys Ser Thr Lys 
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55 70 75 8 o 

Gin Val Glu Leu Thr Ala Gin Asp Val Lys Asp Ala his Trp Phe Tyr 
85 90 95 

Asp Thr Pro Gly lie Thr Lys Glu Asn Cys lie Leu Asn r.eni Leu Thr 
100 105 110 

Glu Lys Glu Val Asn He Val Leu Pro Thr Gin Ser lie Val Pro Arg 
115 120 125 

Thr Phe Val Leu Lys Pro Gly Met Val Leu Phc Leu Gly. Ala He Gly 
130 135 140 

Arg He Asp Phe Leu Gin Gly Asn Gin Ser Ala Trp Phe Thr Val Val 
145 150 155 . 160 

Ala Ser Asn He Leu Pro Val His He Thr Ser Leu Asp Arg Ala Asp 
165 170 175 

Ala Leu Tyr Gin. Lys His Ala Gly His Thr Leu Leu Gin He Pro 'Met • 
180 185 " . 190 

Gly Gly Lys Glu Arg Met Ala Gly Phe Pro Pro Leu Val Ala Glu Asp 
195 200 205 

lis Met Leu Lys Glu Gly Leu Gly Ala Ser Glu Ala Val Ala Asp He 
210 215 220 

Lys Phe Ser Ser Ala. Gly Trp Val Ser Val Thr Pro Asn Phe Lys Asp 
225 230 235 240 

Arg Leu His Leu Arg Gly Tyr Thr Pro Glu Gly Thr Val Leu Thr Val 
245 250 ' * 255 

Arg Pro Pro Leu Leu Pro Tyr He -Val Asn He Lys Gly Gin Arg He 
26G 26S 270 

Lys Lys Ser Val Ala Tyr Lys Thr Lys Lys Pro Pro Ser Leu Met Tyr 
275 280 1 285 

Asn Val Arg Lys Lys Lys Gly Lys He Aan Val 
290 295 



<210> 345 
<211> 314 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (147) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> < 211) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<dO0> 345 

Met Leu Pro Ala Arg Leu Pro Pee Arg Leu Leu Ser Leu Phe Leu Arg 

1.-5 .10 15 

Gly Sex Ala Pro Thr Ala Ala Arg His Gly Leu Arg Glu Pro Leu Leu 
20 25 30 

Glu Arg Arg Cys Ala Ala Ala Ser Ser Phe Gin His Ser Ser Ser Leu ' 
35 40 45 

Gly Arg Glu Leu Pro Tyr Asp Pro Val Asp Thr Glu Gly Phe Gly Glu 
50 55 60 

Gly Gly Asp Met Gin Glu Arg Phe Leu Phe Pro Glu Tyr He Leu Asp 
65 70 75 80 

Pro Glu Pro Gin Pro Thr Arg Glu Lys Gin Leu Gin' Glu Leu Gin Gin 
85 90 95 

Gin Gin Glu Glu Glu Glu Arg Gin Arg Gin Gin Arg Arg Glu Glu Arg 
100 105 110 

Arg Gin Gin Asn Leu Arg Ala Arg Ser Arg Glu His Pro Val Val Gly 
115 120 125 

His Pro A3p Pro Ala Leu Pro Pro Ser, Gly Val Asn Cys Ser Gly Cys 
130 135. 140 

Gly Ala Xaa Leu His Gys Gin Asp Ala Gly Val Pro Gly Tyr Leu Pro 
145 . 150 155 160 

Arq Glu Lys Phe Leu Arg Thr Ala Glu Ala Asp Gly Gly Leu Ala Arg 
165 170 * 175 

Thr Val Cys Gin Arg Cys Trp Leu Leu Ser His His- Arg Arg Ala Leu 
180 185 190 

Arg Leu Gin Val Ser Arg Glu Gin Tyr Leu Glu Leu Val Ser Ala Ala 
195 200 205 

Leu Arg Xaa Pro Gly Pro Ser Leu Val Leu Tyr Met Val Asp Leu Leu 
210 ■ 215 220 

Asp Leu Pro Asp Ala Leu Leu Pro Asp Leu Pro Ala Leu Val Gly" Pro 
225 230 235 240 

Lys Gin Leu lie Val Leu Gly Asn Lys Val Asp Leu Leu Pro Gin Asp 
245. 250 . 255 

Ala Pro Gly Tyr Arg Gin Arg Leu Arg Glu Arg Leu Trp Glu Asp Cys 
260 265 270 

Ala Arg Ala Gly Leu Leu Leu Ala Pro Gly Thr Lys Gly His Ser Ala 
275 280 / *235 

Pro Ser Arg Thr Ser His Arg Thr Gly Arg He Arg He Arg Arg Thr 
290 295 300 
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Gly Pro Ala Gin Trp Ser Gly Thr Cys Gly 
305 310 



<21Q> 346 

<211> 380 * - 

<212> PRT 

<213> Hoir.o sapiens 
<400> 346 

Fro Ser Phe Arg Arg Glu Arg Val Glu Thr Gly Gly Gly Gly Pro Val 
1 5 10 15 

Thr Kis Gly Thr Glu Gly Pro Phe Leu. Pro Leu Pro Gly Gly Thr Arg 
20 25 30 

Met Asn Met Thr Gin Ala Arg Val Leu Val Ala Ala Val Val Gly Leu 
35 40 45 

Val Ala Val Leu Leu Tyr Ala Ser lie His Lys He Glu Glu Gly His 
50 . 55 60 

Leu Ala Val Tyr Tyr Arg Gly Gly Ala Leu Leu Thr Ser Pro Ser Gly 
55 70 75 80 

Pro Gly Tyr His lie Met Leu Pro Phe He .Thr Thr Phe Arg Ser Val 
85 90 95 

Gin Thr Thr Leu Gin Thr Asp Glu Val Lys Asn Val Pro Cys Gly Thr 
100 105 110 

Ser Gly Gly Val Met He Tyr He Asp Arg He Glu Val Val Asn Met 
115 120 125 

Leu Ala Pro Tyr Ala Val Phe Asp lie Val Arg Asn Tyr Thr Ala Asp 
130 135 ' 140 

Tyr Asp Lys Thr Leu He Phe Asn ^ys He His His Glu Leu Asn Gin 
"5 150 155 160 

Phe Cys Ser Ala His Thr Leu Gin Glu Val Tyr' He Glu Leu Phe Asp 
165 170 175 

Gin He Asp Glu Asn Leu Lys Gin Ala Leu Gin Lys Asp Leu Asn Leu 
180 185 190 

Met Ala . Pro Gly Leu Thr He Gin Ala Val Arg Val Thr Lys Pro Lys 
195 200 20b 

He Pro Glu Ala lie Arg Arg Asn Phe Glu Leu MRt Glu Ala Glu Lys 
210 215 220 

Thr Lys Leu Leu lie Ala Ala Gin Lys Gin Lys Val Val Glu Lys Glu 
225 230 235' 240 

Ala Glu Thr Glu Arg Lys Lys Ala Val lift Glu Ala Glu Lys He Ala . 

245 250 255 

Gin Val Ala Lys lie Arg Phe Gin Gin Lys Val Met Glu Lys Glu Thr 
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260 265 270 

GIu Lys Arg lie Ser Glu lie Glu Asp Ala Ala Phe Leu Ala Arg. Glu 
275 280 285 

Lys Ala Lys Ala Asp Ala Glu Tyr Tyr Ala Ala His Lys Tyr Ala Thr 
290 2 £5 ' 300 

Ser Asn Lys His Lys Leu Thr Pro Glu Tyr Leu Glu Leu Lys Lys Tyr 
305 310 . 315 320 

Gin Ala He Ala Ser Asn Ser Lys He Tyr Phe Gly Ser Asn lie Pro 
325. ,330 335 

Asn Met Phe Val Asp Ser Ser Cys Ala Leu Lys Tyr Ser Asp He Arg 
340 345 350 

Thr Gly Arg Glu Ser Ser Leu Pro Ser Lys Glu Ala Leu Glu Pro Ser 
355 360 ■ t 365 ■ 

Gly Glu Asn Val He Gin Asn Lys Glu Ser Thr Gly 
370 ■ 375 380 



<210> 347 

<211> 422 • 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (328) 

<223> Xaa ecruals any of the naturally occurring L-amino acids 

<400> 347 

Trp Ser Thr Gly Asn Ala Ser Trp Glu Lys Lys Asp Asn Phe lie Leu 

1.5 10' 15 

Ser Ala Asp Phe Glu Met Met Gly Leu Gly Asn Gly , Arg Arc Ser Met 
20 25 30 

Lys Ser Pro Pro Leu Val Leu Ala Ala Leu Val Ala Cys He He Val 
35 40 45 

Leu Gly Phe Asn Tyr Trp Ile'Ala Ser Ser Arg Ser -Val Asp Leu Gin 
50 55 " * 60 

Thr Arg He Met Glu Leu Glu Gly Arg val Arg Arg Arg Ala Ala Glu 
65 -70 75 , - 80 

Arg Gly Ala Val Glu Leu Lys Lys Ash Glu Phe Cln Gly Glu Leu Glu 
85 90 95 

Lys Gin Arg Glu Gin Leu Asp Lys He Gin Ser Ser His Asn Phe Gin 
100 1.05 110 

Leu Glu Ser Val Asn Lys Leu Tyr Gin Asp Glu Lys Ala Val Leu Val 
115 12C 125 
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Asn Asn lie Thr Thr Gly Glu Arg Leu He Arg Val Leu Gin Asp Gin 
130 135 140 

Leu Lys Thr Leu Gin Arg Asn Tyr Gly Arg Leu Gin Gin Asp Val Leu 
145 150 155 160 

Gin Phe Gin Lys Asn Gin Thr Asn Leu Glu Arg Lys Phe Ser Tyr Asp 
165 1/0 175 

Leu Ser Gin Cys He Asn Gin Met Lys Glu Val Lys Glu Gin Cys Glu 
180 185 190 

Glu Arg He Glu Glu Val Thr Lys Lys Gly Asn Glu Ala Val Ala Ser 
195 200 205 

Arg Asp Leu Ser 3lu Asn Asn Asp Gin Arg Gin Gin Leu Gin Ala Leu 
210 215 220 

Ser Glu Pro Gin Pro Arg Leu Gin Ala Ala Gly Leu Pro His Thr Glu 
225 230 , 235 240 

Val Pro Gin Gly Lys C-ly Asn Val Leu Gly Asn Ser Lys Ser Gin Thr 
245 250 255 

Pro Ala Pro Ser Ser Glu Val Val Leu Asp Ser Lys Arg Gin Val Glu 
260 265 270 

Lys Glu Glu Thr. Asn Glu He Gin Val Val Asn Glu Glu Pro Gin Arg 
275 280 285 

Asp Arg Leu Pro Gin Glu Pro Cly Arg Glu Gin Val Val Glu Asp Arg 
290 295 . 300 

Pro Val Gly Gly Arg Gly Phe Gly Gly' Ala Gly Glu Leu Gly Gin Thr 
305 310 ' 315 320 

Pro Gin Val Gin Ala Ala Leu Xaa Val Ser Gin Glu Asn Pro Glu Met 
325 f 330 335 

Glu Gly Pro Glu Arg Asp Gin Leu Val He Pro Asp Gly Gin Glu Glu 
340 345 350 

Glu Gin Glu Ala Ala Gly Glu Gly Arg Asn Gin Gin Lys Leu Arg Gly 
355 360 365 

Glu Asp Asp Tyr Asn Met Asp Glu Asn Glu Ala Glu Ser Glu Thr Asp 
370 375 380 

Lys Gin Ala Ala Leu Ala Gly Asn Asp Arg Asn He Asp Val Phe Asn 
385 390 395 400 

Val Glu Asp Gin Lys Arg Asp Thr He Asn Leu Leu Asp Gin Arg Glu 
405 410 415 

Lys Arg Asn His Thr Leu 
420 



<210> 348 
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<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 348 

Ser Leu His Arg Phe Val Leu Ser Gin Ala Lys Asp Glu Leu . 
1 ,5 10 

<210> 349 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 349. 

Phe lie Lys Phe Phe Ala Pro Trp Cys Gly His Cyo Lye Ala^ Leu Ala 
1 5 ■ 10 15 

Pro Thr Trp 



<210> 350 

<211> 19 * 
<212> PRT 

<213> Homo sapiens 
<400> 350 

Phe lie Lys Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Thr Leu Ala 
1 5 .10 15 

Pro Thr Trp 



<210> 351 

<211> 363 . ■ ■. ' . 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (42) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 351 

Arg Arg Gly Arg Gly Val Pro Gly Pro Arg Gly Arg Arg Arg Leu Trp 
1 5 ... 10 15 

Ser Ala Ala Cys Gly His Cys Gin Arg Leu Gin Pro, Thr Trp Asn Asp 
20 25 30 • 

Leu Gly Asp* Lys Tyr Asn Ser Met Glu Xaa Ala Lys Val Tyr Val Ala 
35 40 45 

Ly3 Val Asp Cys Thr Ala His Ser Asp Val Cys Ser Ala Gin Gly Val 
50 55 60 

Arg Gly Tyr Pro Thr Leu Lys Leu Phs Lys Pro Gly Gin Glu - Ala Val 
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65 70 75 80 

Lys Tyr Gin Gly Pro Arc Asp Phe Gin Thr Leu Glu Asn Trp Met Leu 
85 " 90 . 95 

Gin Thr r.eu Asn Glu Glu Pro Val Thr Pro Glu Pro Glu Val Glu Pro 
100 ' 105 110 

Pro Ser Ala Pro Glu Leu Lys Gin. Gly Leu Tyr Glu Leu Ser Ala* Ser 
115 120 125 

Asn Phe Glu Leu His Val Ala Gin Gly Asp His Phe lie Lys Phe Phe 
130 135 140 

Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Thr ?rp Glu Gin 
145 ' 150 155 160 

Leu Ala Leu Gly Leu Glu His Ser Glu Thr Val Lys lie Gly Lys Val 
165 170 , 175 

A.sp Cys Thr Gin His Tyr Glu Leu Cys Ser Gly Asn Gin Val Arg Gly 
180 185 190 

Tyr Pro Thr Leu Leu Trp Phe Arg Asp Gly Lys Lys Val Asp Gin Tyr 
195 200 * 205 

Lys Gly Lys Arg Asp Leu Glu Ser Leu Arg Glu Tyr Val Glu Ser Gin 
210 215 ' "220 

Leu Gin Arg Thr Glu Thr Gly Ala Thr Glu Thr Val Thr Pro' Ser Glu 
225 230 235' 240 

Ala Pro Val Leu Ala Ala Glu Pro Glu Ala Asp Lys Gly Thr Val Leu 
245 , 250 255 

Ala Leu Thr Glu Asn Asn Phe Asp Asp Thr He Ala Glu Gly He Thr 
260 265 270 

Phe rle Lys Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Thr Leu Ala 
275 280 285 

Pro Thr Trp Glu Glu Leu Ser Lys Lys Glu Phe Pro Gly Leu Ala Gly 
290 " 295 300 

Val Lys Tie Ala Glu Val Asp Cys Thr Ala Glu Arg Asn Tie Cys Ser 
305 310 315 320 

Lys Tyr ser Val Arg Gly Tyr Pro Thr Leu Leu Leu Phe Arg Gly Gly 
325 330 335 

Lys Lys Val Ser Glu His Ser Gly Gly Arg As? Leu Asp Ser Leu His 
340 "345 350 

Arg Phe Val Leu Ser Gin Ala Lys Asp Glu Leu 
355 360 



<210> 352 
<211> 93 
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<212> PRT 

<213> Homo sapiens 
<400> 352 

Met Arg Pro Gin Gly Pro Ala Ala 
1 5 

Leu Leu Leu Leu Leu Leu Gin Leu 
20 

Tie Pro Lys Gly Lys Gin Lys Ala 

35 40 

Thr Cys He Met Glu Cys Ala Tyr 
50 55 

Glu Thr Gly Ala Leu Gly -Pro Met 
65 70 

Gin Val Gly Met Asp Ser Lys Glu 
35 



Ser Pre Gin Arg Leu Arg Gly Leu * 
10 15 

Pro Ala Pro Ser Ser Aia Ser Glu 
25 30 

His Ser Gly Arg Gly Arg Trp Trp 
45 

Lys Gly Gin Gin Glu Cys Lau Val 
60 

Ala Phe Arg Val His Leu Gly Ser 
75 80 

Lys Arg Gly Asn Val 
90 



<210> 353 

<211> 273 . 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (210) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 353 ' 

Glu Thr Arg Val Lys Thr Ser Leu Gin L.en T,mi Arg Thr Gin Leu Glu 

1 . . 5 10 15 

Pro Thr Gly Thr val Gly Asn Thr He Met Thr Ser Gin Pro Val Fro 
20 "25 30 

Asn Glu Thr He Zle Val Leu Pro Ser Asn Val lie Asn Phe Ser Gin 
35 40 t 45 

Ala Glu Lys Pro Glu Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys 
50 55 60 

His Leu His Ala Glu He Lys Val lie Gly Thr He Gin He Leu Cys 
65 vu 75 80 

Gly Met Met Val Leu Ser Leu Gly He He Leu Ala Ser Aia Ser Phe' 
85 90 ''95 

Ser Pro Asn Phe Thr Glr. Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr 
100 105 110 

Pro Phe He Gly Fro Phe Phe Phe He He Ser Gly Ser Leu Ser He 
115 120 125 

Ala Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His S*r Ser Leu Val 
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120 135 140 

Gly Ser He Leu Ser Ala Leu Ser Ala Leu Val Gly Phe He He Leu 
145 150 155 160 

Ser Val Lys G]n. Ala Thr Leu Asn Pro Ala Ser Leu Gin Cys Glu Leu 
165 170 * 175 

Asp Lys Asn Asn lie Pro Thr Arg Ser Tyr Val Ser Tyr Phe Tyr His 
180 185 190 

Asp Ser Leu Tyr Thr Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala 
195 200 ' 205 

Gly Xaa Leu Ser Leu Met Leu, He Cys Thr Leu Leu Glu Phe Cys Leu 
210 215, , 220 

Ala Val Leu Thr Ala Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe 
225 230 235 240 

Pro Gly Ser Val Leu Phe Leu Pro His Ser Tyr lie Gly Asn Ser Gly 
245 250 255 

Met Ser Ser Lys Met Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr 
260 265 270 



<210> 354 
<211> 192 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SjCTE 
<222> (129) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 354 

Met Met Val Leu Ser Leu Gly lie He Leu Ala Ser Ala Ser Phe Ser 
1 5 10 15 

Pro Asn Phe Thr Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro 
20 25 30 

Phe He Gly Pro Phe Phe Phe He He Ser Gly Ser Leu Ser He Ala 
35 '40 45 

Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly 
50 55 60 

Ser He Leu Ser Ala Leu Ser Ala Leu Val Gly Phe He He Leu Ser 
65 70 75 * 80 



Val Lys Gin Ala Thr. Leu Asn Pro Ala Sex Leu Gin Cys Glu Leu Asp 
85 90 95 
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Lys Asn Asn lie Pro Thr Arg Ser Tyr Val Sar Tyr Phe Tyr His Asp 

100 105 110 

Ser Leu Tyr Thr Thr Asp Cys Tyr Thr . Ala Lys Ala Ser Leu Ala Gly 

115 120 125 

Xaa Leu Ser Leu Met Leu lie Cys Thr Leu Leu Glu Phe Cys Leu Ala 

130 135 140 

Val Leu Thr Ala Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro 

145 150 155 160 

Gly ser Val Leu Phe Leu Pro His Ser Tyr lie Gly Asn Ser Gly Met 

165 . 170 175 

Met Thr Kis.Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 

180 185 190 



Ser Ser Lys 



<210> 355 
<211> 204 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (119) 

<223> Xaa equals any of the naturally occurring L- amino acids 

<40O> 355 

Gly Ala Ser Cys Glu Gly Gly Gly Ala Ala Ala Arg* Ala Ala Leu Gly 
1. 5 10 15 > 

Val His Arg Ser Gin Lys Ala Leu Leu Val Phe Arg Arg Thr Leu Ser 
20 25 30 

Asn- Leu Leu Tyr Met Pro Leu Leu Arg Gly Leu Leu Trp Leu Gin Val 
35 * 40 45 

Leu Cys Ala Gly Pro Leu His Thr Glu Ala Val Val Leu Leu Val Pro 
50 55 60 

Ser Asp Aop Gly Arg Ala Phe Leu Leu Arg Ser Arg Leu Leu His Pro 
65 ,70 75 80 

Glu Ala His Val Pro Pro Ala Ala Asp Arg Gly Ala , Ser Leu Gin Cys 
85 90 95 

Val Leu His Gin Ala Ala Pro Lys Ser Arg Pro Arg Ser Pro Ala Ala 
100 105 110 

Gly Ala Ala Leu Leu His Xaa Pro Arg Arg Thr Gly Asp Glu Pro Cys 
115 120 125; 

Arg Gli phe His Gly Asn Gly Phe Pro Gly Pro Thr Gin Leu Thr Pro 
130 135 . 140 
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Gly Glu Cys Gly Leu Pro Ale Pro Ser Ser Leu Leu Gin His Ala Ser 

145 150 155 ISO 

Ala Pro Val Arg Thr Gly Ser Glu Gly Gin Val Val Gly Cys Pro Arg 
165 170 175 

Ala Arg Gly Glu Thr " Gly. Glu Gly Leu Ser Leu Ala Phe Leu Ser Ser 
180 185 > 190 

Leu Met Phe Thr Ser Arg Asn Gly Leu Val Gly Cys 
195 200 



<21Q> 356 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<400> 356 

Met Gly Ser Ala Ala Leu Glu lie Leu Gly Leu Val Leu Cys Leu Val 
1 * 5 10 15 

Gly Trp Gly Gly Leu lie Leu Ala Cys Gly Leu Pro Met Trp Gin Val 
20 25 . 30 

Thr Ala Phe Leu Asp His Asn He Val Thr Ala Gin Thr Thr Trp Lys 
35 40 45 

Gly Leu Trp Met Ser Cys Val Val Gin Ser Thr Gly Thr Cys Ser Ala 
50 55 GO 



i.ys Cys Thr Thr Arg Cys Trp Le-2 
65 70 



<210> 357 ...... 

<211> 115 

<212> PRT 

<213> Homo sapiens 

<400> 357 

Leu Lys Arg Ala Pro Pro Gly Pro Ala Leu Ala Lys Gly Leu Leu Gin 
1 5 10 15 

Pro Sor Ser Thr Phe Gin Ala Leu Glu Thr Asn He Gly Asp Gin Vdl 
20 . 25 30 

Arg Arg His Ser Thr Ala Val Val He Arg Glu Met Thr Ser Tyr lie 
35 40 45 

Leu He Ser Phe Val Leu Leu He Gly Val Gly Cys lie Glu Lys Asp 
50 55 60 

Gin Ser Cys Pro Val Phe Gly Gly Arg Lys Arg Leu His Leu Leu Phe 
65 70 75 30 

Val Gly Gly Gin Leu. Arq Gin Val Arg Met Leu Ara Gly Glu Leu Ser 
85 90 95 
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Cys Ala Cys Tyr Arg Pro His Val Gin Ala Leu Gin Leu Gly Gly Cys 
100 105 110 

Thr Cys Phe 
115 



<210> 358 
<211> 88 
<212> PRT 

<2X3> Homo sapiens 
<400> 358 

Val He Lys Leu lie Cys Pro Ala Ala Phe Pro Val Tyr Phe Gin Asp 
15 10 15 

Met Ala Arg Gly Cys Val Cys Ser Leu Cys Ala Ser Val Cys lie Phe 
20 25 30 

Leu Ser Ser Leu Phe Pro Leu Leu Pro Ser Val His Ser Val Asn lie 
35 40 • 45 

He Ser Cys Leu Leu Leu Ser Lys Cys Phe Glu Gly Leu Glu Leu Met 
50 55 60 

Cys Glu His Leu Tyr Gin Leu Ser Gin Leu His Val Leu His His lie 
55 70 75 80 

Phe Ser Tyr Leu Leu Cys Thr Pro 
85 



<210> 359 
<2L1> 716 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (2) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (373) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (705) 

<223> xaa equals any of the naturally occurring L-a.nino acids 
<40O> 359 

Tyr Xaa lie Pro Gly Ser Thr His Ala Ser Gly Arg Gin Arg Gly Ser 
1 5 , 10 lb 



Gly Arg Gly Glu Asp Asp Ser Gly Pro Pro Pro Ser Thr Val He Asn 
20 25 30 
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Gin Asn : Glu Thr Phe Ala Asn lie He Phe Lys Pro Thr Val Val Gin 
35 40 45 

Gin Ala Arg He Ala Gin Asn Gly Tie Leu Gly Asp Phe He He Arg 
50 55 60 

Tyr Asp Val Asn Arg Glu Gin Ser He Gly Asp He Gin Val Leu Asn 
65 70 75 80 

Gly Tyr Phe Val His Tyr Phe Ala Pro Lys Asp Leu Pro Pro *Leu Pro 
B5 90 95 ' 

Lys Asn Val Val Phe Val Leu Asp Ser Ser Ala Ser Met Val Gly Thr 
100 105 - * 110 

Lys Leu Arg Gin Thr Lys A£p Ala Leu Phe Thr He Leu His Asp Leu 
115 120 125 

Arg Pro Gin. Asp Arg Phe Ser He He Gly Phe Scr Acn Arg He Lys 
130 ^ 135 140 

val Trp Lys Asp His Leu He Ser Val Thr Pro Asp Ser He Arg Asp 
145 150 155 160 

Gly Lys Val Tyr He His His Ket Ser Pro Thr Gly Gly .Thr Asp lie 
165 170 175 

Asn Gly Val Leu Gin Arg Ala He Arg. Leu Leu Asn Lys Tyr Val Ala 
180 185 190 

His Ser Gly He Gly Asp Arg Ser Val Ser Leu He Val Phe Leu Thr 
195 200 205' 

Asp Gly Lys Pro Thr Val Gly Glu Thr His Thr Leu Lys He Leu Asn 
210 215 220 

Asn Thr Arg Glu Ala Ala Arg Gly Gin Val Cys He Phe Thr He Gly 
225 230 235 240 

He Gly Asn Asp' Val Asp Phe Arg Leu Leu Glu Lys Leu Ser Leu Glu 
245 250 255 

Asn Cys Gly Leu Thr Arg Arg Val His Glu Glu Glu Asp Ala Gly Ser 
260 265 270 

Gin Leu He Gly Phe Tyr Asp Giu lie Arg Thr Pro Leu Leu Ser Asp 
275 280 235 

He Arg He Asp Tyr Pro Pro Ser Ser Val Val Gin Ala Thr Lys Thr 
290 ' 295 300 



Leu Phe Pro Asn Tyr Phe Asn Gly 
305 310 



Leu Val Asp Arg Lys Leu Asp His 
325 



Asn Ser Lys Lys Phe He He Lsu 



Ser Glu He He He Ala Gly Lys 
315 320 

Leu His val Glu Val Thr Ala Ser 
330 335 

Lys Thr Asp Val Pro Val Arg Pro 
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340 345 350 

• Gin Lys Ala Gly Lys Asp Val Thr Gly Ser Pro Arg Pro Gly Gly Asp 
355 . 360 * . 365 

Gly Glu Gly Asp Xaa Asn Hie He Clu Arg Leu Trp Ser Tyr Leu Thr 
370 375 380 

Thr Lys Glu Leu Leu* Ser Ser Trp Leu Gin Ser Asp Asp Glu Pro Glu 
385 390 395 . 400 

Lys Glu Arg Leu Arg Gin Ary Ala Gin Ala Leu Ala Val Ser Tyr Arg 
405 ' 410 415 

Phe Leu Thr Pro Phe Thr Ser Met Lys Leu Arg Gly Pro Val Pro Arg 
420 ' 425 430 

Met Asp Gly Leu Glu Glu Ala His Gly Met Ser Ala Ala Met Gly Pro 
435 -440 445 

Glu Pro Val Val Gin Ser Val Arg Gly Ala Gly Thr Gin Pro Gly Pro 
450 45S 460 

Leu Leu Lys Lys Pro Tyr Gin Pro Arg He Lys He Ser Lys Thr Ser 
465 470 475 480 

Val Asp Gly Asp Pro His Phe Val Val Asp. Phe Pro Leu ser Arg Leu 
485 490 495 

Thr Val Cys Phe Ash lie Asp Gly Gin Pro Gly Asp lie Leu Arg F Leu 
500 505 510 

Val' ser Asp His Arg Asp Ser Gly Val Thr Val Asn Gly Glu Leu He 
■ 515 520 525 

Gly Ala Pro Ala Pro Pro Asn Gly His Lys Lys Gin. Arg Thr Tyx Leu 
530 535 540 

Arg Thr lie Thr He Leu He Asn Lys Pro Ciu Arg Ser Tyr Leu Glu 
545 550 555 560 

lie Thr Pro Ser Arg Val He Leu Asp Gly Gly Asp Arg Leu Val Leu 
565 570 575 

Pro Cys Asn Gin Ser Val Val Val Gly Ser trp Gly Leu Glu Val Ser 
580 585 590 

Val ser Ala Asn Ala Asn Val Thr Val Thr He Gin Gly Ser He Ala 
595 SQ0 605 

Phe Val He Leu lie His Leu Tyr Lys Lys Pro Ala Pro Phe Gin Arg 
610 . 61^ 620 

His His Leu Gly Phe Tyr lie Ala Asn Ser Glu Gly Leu Sier Ser Asn 
.625 630 635 640 

Cys His Gly Leu Leu Gly Gin Phe Leu Asn Gin Asp Ala Arg Leu Thr 
; 645 650 • 655 
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Glu Asp Pro Ala Gly 
660 

Gin Val Gly Glu Gly 
675 

Val Pro Val Val Trp 
690 

Xaa As s Cys Trp Phe 
705 



Pro Ser Gin Asn Leu 

• 665 

Pro Glu Ala Val Leu 
680 

Lys Gin Arg Lys lie 
695 

Ala Arg Asn Met Pro 
710 



Thr His Pro Leu Leu Leu 
670 



Thr Val Lys Gly His Gin 
685 



Tyr Asn Gly Glu Glu Gin 
700 



Pro Asn 
71S 



<210> 360 
<211> 387 
<212> PRT. 
<213> Homo sapiens 

<"400> 360 

Pro Arg Val Arg Ser lie Lys Val Thr Glu Leu Lys Gly Leu Ala Asn 

1 .5 10 IS 

His val Val Val Gly Ser Val Ser Cys Glu Thr Lys Asp Leu Phe Ala 
20 25 30 

Ala Leu Pro Gin Val Val Ala Val Asp lie Asn Asp Leu Gly Thr lie 
35 40 45 



Lys Leu Ser Leu Glu Val Thr Trp Ser Pro Phe Asp Lys Asp Asp Gin 

50 55 60 

Pro Ser Ala' Ala Ser Ser Val Asn Lys Ala Ser Thr Val Thr Lys Arg 

65 70 75 60 

Phe Ser Thr Tyr Ser Gin Ser Pro Pro Asp Thr Pro Ser Leu Arg Glu 

85 90 95 

Gin Ala Phe Tyr Asn Met Leu Arg Axg Gin Glu Glu Leu Glu Asn Gly 

100 105 110 

Thr Ala Trp Ser Leu Ser Ser Glu Ser Ser Asp Asp Ser Ser Ser Pro 

115 120 125 

Gin Leu Ser Gly Thr Ala Arg His Ser Pro Ala Pro Arg Pro Leu Val 

130 - 135 140 

Gin Gin Pro Glu Pro Leu Pro lie Gin Val Ala Phe Arg Arg Pro Glu 

145 150 155 160 

Thr Pro Ser Ser Gly Pro Leu Asp Glu Glu Gly Ala Val Ala Pro val 

. 165 ' 170 175 . 

Leu Ala Asn Gly His Ala Pro Tyr Ser Arg Thr Leu Ser His lie Ser 

180 IBS • 190 

Glu Ala Ser Val Asn Ala Ala Leu Ala Clu Ala Ser Val Glu Alo" Val 

195 200 205 , 

Gly Pro Lys Ser Leu Ser Trp Gly Pro Ser Pro Pro Thr His Pro Ala 
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210 



215 



220 



Pro Thr Kis GLy Lys His Pro Ser Pro Val Pro Pro Ala Leu Asp Pro 
225 230 235 240 

Gly His Ser Ala Thr Ser Ser Thr Leu Gly Thr Thr Gly Ser Val Pro 
245 250 255 

Thr Ser Thr Asp Pro Ala Pro Ser Ala His Leu ,Asp Ser Val His Lys 
260 265 270 

Ser Thr Asp Ser Gly Pro Ser Glu Leu Pro Gly Pro Thr His Thr Thr 
275 280 285 

Thr Gly Ser Thr Tyr Ser Ala lie Thr Thr Thr His Ser Ala Pro Ser 
290 . 295 300 

Pro Leu Thr His Thr Thr Thr Gly Scr Thr His Lys Pro He lie Ser 
305 310 315 320 

Thr Leu Thr Thr Thr Gly Pro Thr Leu Asn lie He Gly Pro Val Gin 
325 330 335 

i 

Thr Thr Thr Ser Pro Thr His Thr Met Pro ser Pro Ser Ser His Ser 
340 345 350 

Asn Ser Pro Gl n Tyr Val Asp Phe Cys Ser Ser Val Cys Asp Asn He 
355 360 365 

Phe Val His Tyr Val He Gly He Phe Phe His Thr Leu Tyr Ser Ser 
370 375 380 

Lys Thr Leu 

385 , 



<210> 361 
<211> 260 
<212> PRT 

<213> Homo sapiens 

1 <400> 361 ' * ' 

Pro Arg Val Arg Ser He Lys Val Thr Glu Leu Lys Gly Leu Ala Asn 
15 10 15 

His Val Val Val Gly Ser Val Ser Cys Glu Thr Lys Asp Leu Ph*- Ala' 
20 . ^ 25 30 

Ala Leu Pro Gin Val Val Ala Val Asp He Asn Asp Leu Gly Thr He, 
' ■ ■ 35 40 " 45 

Lys Leu Ser Leu G-tu Val Thr Trp Ser Pro Phe Asp Lys Asp Asp Glh 
50 55 60 

Pro Ser Ala Ala Ser Ser Val Asn Lys Ala Ser Thr Val Thr Lys Arg 
65 70 75 80 

Phe Ser Thr Tyr Ser Gin Ser Pro Pro Asp Thr Pro Ser Leu Arg Glu 



95 



90 



95 
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Gin Ala Phe Tyr Asn Mot Leu Arg Arg Glr. Glu Glu Leu Glu 
100 105 110 



Asn Gly 

Ser ser Glu Ser Ser Asp Asp Ser Ser Ser Pro 



Thr Ala Trp Ser Leu 

li5 120. 

Gin Leu Ser Gly Thr Ala Arg Hie 
130 135 

Gin Gin Pro Glu Pro Leu Pro He 
145 150 

Thr Pro Ser Ser Gly Pre Leu Asp 
165 

Leu Ala Asn Gly His Ala Pro Tyr 
lflO 

Glu Ala Ser Vai Asn ALa Ala Leu 
195 200 

Gly Pro Lys- Ser Leu Ser Trp Gly 
210 215 

Pro Thr His Gly Lys His Pro Ser 
225 230 

Gly His Ser Ala Thr Ser Ser Thr 
245 

Thr Ser Thr Asp 
260 



125 

Ser Pro. Ala Pro Arg Pro Leu Val 
140 

Gin Val Ala Phe Arg Arg Pro Glu 
155 160 

Glu Glu Gly Ala Val Ala Pro Val 
170 175 

Ser Arg Thr Leu Ser His lie Ser 
185 ^ 190 

Ala Glu Ala Ser Val Glu Ala Vai 
205 

Pro Ser Pro Pro Thr His Pro Ala 
220 

Pro Val Pro Pro Ala Lea Asp Pro 
235 240 

Leu Gly Thr Thr Gly Ser Val Pro 
250 255 



<210> 362 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<400> 362 

Tyr Gly Cys Glu Lys Thr Thr Glu Gly Gly Arg Arg Arg Arg Arg Arg 
1 5 10 15 

Met Glu Ala Val Val Phe Val Phe Ser Leu Leu Asp Cys Cys Ala Leu 
20 25 30 

lie Phe Leu Ser Val Tyr Phe He He Thr Leu Ser Asp Leu Glu Cys 
35 .40 45 

Asp Tyr Tie Asn Ala Arg Ser Cys Cys Ser Lys Leu Asn Lys Trp Val 
50 55 60 

He Pro Glu Leu He Gly His Thr lie Val Thr Val Leu Leu Leu Met . 
65 70 75 80 



Ser Leu His Trp Phe 
85 



He Phe Leu' 



Leu Asn Leu Pro Val Ala 
90 



Thr Trp 
95 
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Asn lie Tyr Arg Tyr lie Met Val Pro Ser Gly Asn Met Gly Val Phe 
1O0 105 110 

Asp Pro Thr Glu lie His Asn Arg Gly Gin Leu Lys Ser His Met Lys 
115 120 125 

Glu Ala Met lie Lys Leu Gly Phe His Leu Leu Cys Phe Phe Met Tyr 
130 135 140 



Leu Tyr Ser Met lie Leu Ala Leu lie Asn Asp 
145 150 155 



<210> 353 
<211> 70 
<212> PRT 

<213> Homo sapiens 



<400> 363 

Ala Art? Ala Pro Ala Pro Ser Leu 
1 5 

Ala Leu Ala Pro Ala His Ser Leu 
20 

Ser Gly Ser Ser Leu Pro Ser Ala 

35 40 



Pro' Pro Leu Pro Ser Pro Ala Pro 

10 ( 15 . 

Leu Gly Leu Leu Leu Gly Arg Met 
25 30 

Leu Ala Leu Ser Leu Leu Leu Val 
45 



Ser Gly Ser Leu' Leu Pro Gly Pro Gly Ala Ala Gin Asn Val Arg Val 
50 55 60 

Gin Ser Gly Gin Asp Gin 
65 70 



<210> 364 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 364 

Gly Thr Ser Lys Asp Cys Val Leu Tyr Ala Phe Leu Asp Pro Gly Met 
1 5 10 15 

Ala Val Pro Leu Phe Leu' Tyr lie Phe Thr Leu Leu Pro Leu Leu Pro 

20 25 . , 30 

Phe Leu Leu Ser Leu Cys Phe Ser Pro Leu Thr val Lys Arg ser Ser 
35 40 45 

Ser Ser Glu Ser Lys Ser Ser Leu 

50 ' 55 ' ■ 
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B. IDENTIFICATION OF DEPOSIT 



Furtherdeposit5areidcnuHeiJonanadduionaJsheet [ [ 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (incltxiiitf posiai code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States ot America 
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Juty27, 1998 
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203069 
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Europe 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism wiii be made available until the publication of the mention ot the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
tne issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 
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CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
appUcation or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of die deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on die list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest m the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be 'filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application lias been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

the applicant hereby requests that, until the application lias been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS ~ 

The applicant hereby requests that until the dale of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 3 1F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
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A. The indications mode below rclnteto the microorganism referred to inrhcdeschpcion 
onpage 2*3 Hnc WA 

B. IDENTIFICATIONOFDEPOSIT Further deposits are identified on an additional sheet |""| 

Namedf depositary institution American Type Culture Collection 



Address of depositary institution litubJing postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit 




Accession Number 






June 11, 1098 




209965 



C ADprriONALINDICATIONS^fi7wA^i|fi«cppiice6/f* This information is co rrt inued o n an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications an not for aildttignaud Stales) 



Europe 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issue of such a sample to an expert nominated by the person requesting the sample {Rule 28(4) EPC). 



E. SEPARATE FURNISHING OF INDI CATIONS (IcavtbtankifnotappiieabU) 



The indications listed below will be submitted to the International Bureau later i^cifyihtfrnt^nantnot'iheindicantme.g, 
" — of Deposit") 



For recei ving Office use only 




For Internationa I Bureau use only 



["1 This sheet was received by the international Bureau on: 



Authorized officer 



Form PCnVRQ/134 (July 1992) 
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CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in. the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only he effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention - 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public ' 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National' Board of Patents and Registration withouUiaving been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert- The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, die furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with Lhe International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1 ) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule libis) 



A. The indications nude below relate to the microorganism referred to in the description 
onpage 249 !ine . NM 



B. IDENTIFICATION OFDEPOSIT Further deposits are identified an on additionaJ sheet [""[ 



Name of depositary institution American Type Culture Collection 



'Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date or deposit 

June 26, 1998 



Accession Number 

203027 



C ^tornONALlNDlCAllOTXSUtweoiaxkiftwtcppiicable) This information is conn nued on an tddioonai sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications ore not for all designated Stales) 



Europe 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 



E. SEPARATE FURNISHING OF INDICATIONS/ Uavtblankifnotapplicablel 



The indications listed, below will be submitted to the International Bureau \^T(rpec^thegmcntlna^f^^indkxsuxat.^., "Acctssica 
liimber of Deposit "f 



For receiving Office use only 




For International Bureau use only 



[~~1 This sheet Was received by the International Bureau on: 



■^Authorized officer 



Form PCT/RO/IM (July 1992) 
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CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY * 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of Ihe Australian Patents Regulations). 

FINLAND ' - 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request (o this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that al the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third pany for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 3 I F(l) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
puhlic under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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Applicant's or agent s file ™^ n^.T I International applicanoc 

reference number PZ031PCT | Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule )3bis) 



A. The indications mace be Jow relate to the rma-oargaru.xm referred to 
onpage 253 i ine: 


n the description 
N/A 




a. roENTmcATiONOFDEPosrr 


Further deposits are identified on an additional she 





Nameof depositary institution American Type Culture Collection 



Address of depositary institution (inciudine postal code and country) 

10801 University Boulevard 
Manassas. Virginia 20110-2209 
United States of America 



Date of deposit 




Accession Number 






Jury 27. 1898 




203071 



C ADDITIONAL tKOlCATlONS{ leave blank not apptkabtet This information is continuedon an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MAD E / tfthe indications are not for <dl designated Satetf 
Europe 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, onty by 
the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 



E* SEPARATE FURNISHING OF INDICATIONS Ueave blavkif not applicable t 

The indications listed below wilt be submitted to the International Bureau later r jp«yv the general nanae of (he uuhavwns tg, "Atzenxn 
Number of Deposu") 



^^Tris sheet was received with the international application 



For receiving Office use only 



Authorized officer 




Forlntemaaonai Bureau use only 



f~~l This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT7RO/I34 (July 1992) 
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CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

the applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent* 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If sUch a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

Ttie applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert! The request to this effect must be filed by the applicant with * 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests thai, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/1 34 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request lias been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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Applicant's or agent's file 
reference number 



PZ031PCT 



International a pp ticarior 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc llbis) 



A. The indications made be low iclait: u>thcnvaborgara»nrcferiedtoinihcilei^npuun 
on page ?S9 .line WA 



B. IDENnnCATiOVOFDEPOSIT FunJiefdepositsaretdenufiedonanaddirionaJtheet | | 



Name of depositary institution American Type Culture Collection 



Address of depository institution (including postal ixxie and countrj) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit 

July 27, 1998 



Accession Number 

203070 



C ADD1TlQNALlNDlCATlONS(lea\vbtartifrimappiicablii Thisiafbnnrooniscontinuedon^ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for ail designated Slates) 



Europe 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until trie publication of the mention of the grant of the European patent 
or until the data on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issue of such a sample to an ex pen nominated by the person requesting the sample (Rule 28(4) EPC). 



E. SEPARATE FURNISHINO OF INDICATIONS r'mw Wtuut i//iafappferf)//> 



The indications listed below will be submitted to the International Bureau later (iyecfyiheteneirdncmrtoftherdxaiimsc^ 'Accession 
Number of Deposit") 



For receiving Office useohly 



This sheet was recei ved with 1 



Authorized officer 




laiional application 



imms . ^ 



Foe [ntema&onaJ Bureau use only 



1 | This sheet was received by the International Bureau on: 



Authorized officer 



fwmPCI7RCVlJ4(July 1992) 
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CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent. " 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), of has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the' furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected, to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands . 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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